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the  real  numbers, 
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Dirac  delta  function. 
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pseudo-inverse  of  the  matrix  A  . 

orthogonal  complement  of  the  matrix  A  . 

transpose  of  the  matrix  A  . 

complex  conjugate  transpose  of  the  matrix  A  . 

the  matrix  A  is  positive  deflnite. 

the  matrix  A  is  positive  semi-definite. 

the  i‘*  eigenvalue  of  the  matrix  A  . 

the  ilk  singular  value  of  the  matrix  A  . 

the  maximum  singular  value  of  the  matrix  A  . 

the  kernel  of  the  matrix  A  when  viewed  as  a  linear  operator. 

the  trace  of  the  matrix  A  . 

Frobenius  norm  of  the  matrix  A  . 


R 


proper,  real-rational  functions. 
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When  R  is  used  as  a  prefix,  it  denotes  real-rational.  The  superscripts  ”n”  and  V  xm*  (as  in  Ft 
and  Ft*  * “  )  will  denote  the  corresponding  n -vectors  and  p  Xm  matrices. 


Abbreviations 


rhp 

right  half  plane  of  complex  number  plane. 

Ihp 

left  half  plane  of  complex  number  plane. 

ref 

right  coprime  factorization. 

lef 

left  coprime  factorization. 

S1SO 

single-input/single-output. 

MIMO 

multiple-input/multiple-output. 

LQG 

linear-quad  ratic-Gaussian. 

WHKB 

Wiener-Hopf-Kalman-Bucy. 

LFT 

linear  fractional  transformation. 

GEP 

generalized  eigenvalue  problem. 

10F 

inner-outer  factorization. 

CIF 

complementary  inner  factor. 

ARE 

algebraic  Riccati  equation. 

GDP 

general  distance  problem. 
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CHAPTER  1 
INTRODUCTION 

This  report  presents  some  new  methods  for  analysis  and  synthesis  of  control  systems  for  robust  per* 
formance  in  the  presence  of  structured  uncertainty.  It  builds  on  the  results  of  Doyle  (1984).  The  techni¬ 
cal  approach  involves  the  structured  singular  value,  p,  as  an  analysis  tool  and  Hx  as  a  synthesis  tool. 
These  are  combined  to  form  the  basis  for  p-synthesis. 

The  major  contributions  of  this  report  with  their  corresponding  chapters  are: 

(1)  A  comparison  is  given  of  H&,  performance  and  robustness  formulations  with  some  alternatives 
using  other  norms.  Performance  for  bounded  magnitude  time  signals  are  found  in  terms  of 
H a,,  norms  on  transfer  functions.  (  1  ) 

(2)  Extensions  of  p  to  handle  real  parameter  variations  are  considered.  Improved  bounds  are 
obtained  for  this  problem.  (  1  ) 

(3)  The  issue  of  convergence  of  p-synthesis  to  a  global  optimum  is  studied.  It  is  shown  that  the 
global  solution  to  the  p-synthesis  problem  can  be  found  in  the  constant  (or  equivalently 
acausal)  case.  This  provides  useful  information  for  the  general  case  and  is  encouraging  regard¬ 
ing  the  prospects  of  obtaining  similar  results  there.  (  1  ) 

(4)  A  comprehensive  and  unified  treatment  is  given  to  the  synthesis  of  general  optimal  controllers 
for  linear  lumped  time-invariant  systems.  (  2  ) 

(5)  The  existence  of  an  optimal  solution  for  the  general  //^-optimization  formulation  is  proven. 
In  particular,  it  is  shown  that  there  exists  a  real-rational  optimal  solution  when  the  original 
data  is  real-rational.  (  3  ) 

(6)  An  iterative  procedure,  called  ^-iteration,  is  discussed  which  reduces  the  general  distance  for¬ 
mulation  to  an  equivalent  solvable  best  approximation  formulation.  Some  tight  and  comput- 


able  bounds  are  derived  and  properties  which  guarantee  rapid  convergence  of  the  iteration  are 
established.  (  4  ) 

(7)  A  new  and  efficient  algorithm  for  solving  a  class  of  algebraic  Riccati  equations  that  arises  in 
the  7-iteration  is  obtained.  (  5  ) 

(8)  State-space  representations  and  reliable  algorithms  are  derived  in  parallel  to  the  development 
of  the  other  theory  throughout  this  work.  (  5  ) 

(9)  Explicit  error  bounds  are  obtained  which  make  model  reduction  in  the  synthesis  process  possi¬ 
ble  and  simplify  the  complexity  of  the  controller  significantly.  (  6  ) 

Based  on  the  results  in  this  report  and  in  Doyle{1984),  an  experimental  software  package  has  been 
developed  which  uses  only  standard  matrix  operations  and  linear  algebra  techniques.  Several  example 
designs  have  been  performed  which  have  been  very  successful  and  moat  encouraging.  These  example 
designs  will  be  presented  elsewhere.  Many  of  the  results  presented  in  this  report  have  appeared  in  papers 
by  the  authors. 


1.1  General  Analysis  Framework 


This  chapter  will  review  some  basic  methods  for  analyzing  the  performance  and  robustness  properties 
of  feedback  systems.  The  particular  approach  taken  here  is  from  |D14j-|D19j  and  [C2j-(C4|  which  builds  on 
results  by  many  other  researchers.  In  this  context,  analysis  refers  to  the  process  of  determining  whether  a 
system  with  a  given  controller  has  desired  characteristics,  whereas  synthesis  refers  to  the  process  of  finding 
a  controller  that  gives  desired  characteristics,  usually  expressed  in  terms  of  some  analysis  method.  This  is 
the  fairly  standard  usage  of  these  terms  in  the  control  community.  It  should  be  obvious  that  the  question 
of  analysis  must  be  settled  to  some  degree  before  a  reasonable  synthesis  problem  can  be  posed.  The  formal 
analysis  and  synthesis  techniques  discussed  in  this  report  are  only  some  of  the  methods  that  might  make  up 
the  overall  process  of  engineering  design. 

The  general  framework  to  be  used  in  this  report  is  illustrated  in  the  diagram  in  Figure  1.1.  Any  linear 
interconnection  of  inputs,  outputs,  commands,  perturbations,  and  a  controller  can  be  rearranged  to  match 
this  diagram.  For  the  purpose  of  analysis  the  controller  may  be  thought  of  as  just  another  system  component 
and  the  diagram  reduces  to  that  in  Figure  1.2.  The  analysis  problem  involves  determining  whether  the  error 
e  remains  in  a  desired  set  for  sets  of  inputs  v  and  perturbations  A.  Analysis  methods  differ  on  the  description 
of  these  sets  and  the  assumptions  on  the  interconnection  structure  G.  In  this  report  G  will  be  taken  to  be 
a  linear,  time-invariant,  lumped  system  and  be  represented  by  a  rational  transfer  function.  The  convolution 
kernel  associated  with  G  will  be  denoted  as  g,  so  G  is  a  real-rational  matrix  function  of  a  complex  variable 
and  g  is  a  matrix  function  of  time.  The  interconnection  structure  G  can  be  partitioned  so  that  the  transfer 
function  from  v  to  e  can  be  expressed  as  the  linear  fractional  transformation 

e  s*  F%{G,  A)  v  =  [Gjj  +  Gj|A(/  —  GhA)-1  Gu]  v. 

The  external  input  v  is  an  additive  signal  entering  the  system  and  is  typically  used  to  model  disturbances, 
commands,  and  noise.  The  alternative  descriptions  of  the  sets  to  which  v  is  assumed  to  belong  and  the 
corresponding  performance  requirements  on  e  will  be  considered  in  the  next  section.  It  is  generally  inadequate 
in  modeling  systems  for  control  design  to  consider  uncertainty  only  in  the  form  of  uncertain  additive  signals 
[H4|.  The  system  model  itself  typically  has  uncertainty  which  can  have  a  significant  impact  on  system 
performance.  This  uncertainty  is  a  consequence  of  unmodeled  dynamics  and  parameter  variations  and  is 
modeled  as  the  perturbations  A  to  the  nominal  interconnection  structure  G.  Note  that  the  uncertainty 
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modeled  u  A  has  a  very  different  effect  from  that  of  v  on  the  performance  of  the  system.  For  example, 
perturbations  can  cause  a  nominally  stable  system  to  become  unstable,  which  v  cannot  do.  Techniques  for 
modeling  plant  perturbations  and  analyzing  their  impact  on  stability  will  be  considered  in  Section  3. 

At  the  heart  of  any  theory  about  control  are  the  assumptions  made  about  G,  v  and  A,  as  well  as  the 
performance  specifications  on  e.  These  assumptions  determine  the  analysis  methods  which  can  be  applied 
to  obtain  conclusions  about  system  performance.  A  desirable  objective  is  to  make  weak  assumptions  but 
still  arrive  at  strong  conclusions  and  the  inevitable  tradeoff  implied  by  this  objective  drives  the  development 
of  new  theory.  The  control  theoreticians  role  may  be  viewed  as  one  of  developing  methods  that  allow  the 
control  engineer  to  make  assumptions  which  seem  relatively  natural  and  physically  motivated.  The  ultimate 
question  of  the  applicability  of  any  mathematical  technique  to  a  specific  physical  problem  will  always  require 
a  “leap  of  faith"  on  the  part  of  the  engineer  and  the  theoretician  can  only  hope  to  make  this  leap  smaller. 

The  methods  in  this  chapter  provide  a  powerful  set  of  techniques  for  modeling  and  analyzing  uncertain 
systems.  To  provide  a  context  for  these  methods  consider  an  alternative  analysis  technique  based  on  simu¬ 
lation.  Modern  simulation  methods  allow  for  a  wide  variety  of  plant  models  and  performance  specifications 
and  as  a  consequence  are  an  essential  part  of  the  control  engineers  toolkit.  The  price  paid  for  this  flexibility 
is  that  the  system  inputs  and  perturbations  can  be  considered  only  one  at  a  time.  This  chapter  focuses  on 
describing  teti  of  signals  and  perturbations  and  drawing  conclusions  on  worst-case  performance  for  entire 
sets  at  once.  An  important  issue  to  be  addressed  here  is  the  description  of  sets  which  adequately  model 
physical  phenomena.  It  can  be  just  as  limiting  to  have  models  which  allow  signals  or  perturbations  which 
have  no  physical  motivation  but  severely  impact  performance  (of  the  model)  as  it  is  to  have  models  that 
ignore  critical  physical  phenomena.  The  methods  in  this  chapter  are  aimed  at  allowing  for  signal  and  pertur¬ 
bation  models  that  naturally  match  the  physical  phenomena  that  the  models  are  intended  to  represent.  The 
price  paid  for  this  flexibility,  in  contrast  with  simulation,  is  that  very  restrictive  assumptions  ,  in  particular 
linearity,  must  be  made  about  the  nominal  interconnection  structure.  Nonlinearities  can  only  be  handled  as 
perturbations. 

It  is  beyond  the  scope  of  this  report  to  give  a  thorough  discussion  of  the  relationship  between  models 
and  the  physical  systems  they  represent.  Attention  will  be  given  to  the  main  assumptions  that  have  been 
proven  useful  in  practice,  and  some  comparison  of  the  alternatives.  The  particular  focus  of  this  chapter  is 
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on  developing  techniques  that  allow  very  precise  analysis  of  systems  which  have  fairly  standard  performance 
requirements  and  uncertainty  models  in  terms  of  additive  noise  and  plant  perturbations.  While  the  “best’’ 
assumptions  for  engineering  purposes  will  always  be  a  matter  of  debate,  it  is  clear  that  for  any  given  set 
of  assumptions  it  is  desirable  to  have  very  precise  analysis  techniques.  The  ideal  would  be  necessary  and 
sufficient  conditions  for  the  satisfaction  of  a  performance  specification  in  the  presence  of  sets  of  inputs  and 
perturbations.  Additionally,  the  conditions  should  be  computable  or  should  at  least  yield  bounds  which 
give  useful  estimates  of  system  performance.  With  such  methods,  the  engineer  can  focus  directly  on  the 
relationship  between  uncertainty  assumptions  and  system  performance  without  worrying  about  potential 
gaps  caused  by  inadequate  analysis  techniques. 

The  assumptions  about  v  and  the  system  performance  specifications  on  e  are  considered  in  Section  2.  A 
basic  requirement  is  that  the  nominal  system  (A  =  0)  be  stable.  Recall  that  this  is  analysis  of  the  closed-loop 
system  with  controller  in  place  so  it  is  assumed  that  G  is  stable,  which  will  be  taken  to  mean  having  no 
rhp  poles.  Performance  will  be  expressed  in  terms  of  *  being  contained  in  a  specified  set  bounded  in  power, 
energy,  or  magnitude.  In  Section  2  only  uncertainty  in  v  will  be  considered  so  this  will  be  referred  to  as 
nominal  performance  to  indicate  that  A  =  0.  Nominal  performance  will  be  seen  to  be  equivalent  to  a  norm 
test  on  Gjj.  The  main  focus  of  this  report  will  be  on  ||  •  ||oo.  but  other  norms  of  practical  interest  will  be 
considered  and  briefly  compared.  It  will  be  argued  that  the  ||  •  ||oo  norm  is  a  useful  and  flexible  norm  for 
studying  performance. 

Section  3  considers  stability  in  the  presence  of  perturbations.  Thu  will  be  referred  to  as  robait  >t  ability 
with  robust  used  here  to  indicate  that  the  property  of  stability  is  maintained  under  perturbations.  For  simple 
unstructured  perturbations,  this  also  leads  naturally  to  a  |] •  ||oo  norm  test,  but  now  on  Gu.  The  || •  [|oo  norm 
thus  provides  a  single  norm  which  handles  both  nominal  performance  and  robust  stability.  Unfortunately, 
norm  bounds  are  inadequate  in  dealing  with  more  realistic  models  of  plant  uncertainty  involving  structure 
and  more  complicated  mathematical  objects  involving  the  structured  singular  value,  p,  are  required. 

The  methods  outlined  in  Sections  2  and  3  allow  for  assessing  either  nominal  performance  or  robust 
stability.  Obviously,  it  would  be  desirable  to  treat  performance  with  both  noise  and  perturbations  occurring 
simultaneously.  Section  3  concludes  with  this  problem  and  shows  that  this  also  leads  to  tests  using  /i,  but 
now  involving  the  entire  transfer  function  G.  Thus  p  emerges  as  an  essential  analysis  tool  in  dealing  with 
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nb%tt  perfontunee  u  well  as  with  structured  perturbations. 

Section  4  briefly  reviews  Hoc  ud  Hi  optimal  control.  The  H^  methods  combine  with  the  properties  of 
ft  discussed  in  Section  5  to  form  the  basis  for  ji-synthesis.  Chapters  2-7  of  this  report  expand  on  the  theory 
outlined  in  Section  4. 

The  mathematical  properties  and  computation  of  ft  are  briefly  taken  up  in  Sections  5  for  the  case  of 
complex  perturbations  and  6  for  the  real  ease.  Here  ft  is  viewed  as  a  natural  generalization  of  both  spectral 
radius  and  spectral  norm,  and  this  viewpoint  leads  to  useful  characterizations  of  ft  in  terms  of  these  more 
familUr  quantities.  One  consequence  is  that  estimates  for  ft  can  be  obtained  by  scaling  of  ordinary  singular 
values.  The  implications  of  this  approach  for  synthesis  are  also  briefly  considered  in  Section  7. 

The  main  results  of  this  chapter  will  be  expressed  as  theorems  which  are  each  instances  of  the  following 
form  of  a  General  Analysis  Theorem: 


General  Analysis  Theorem  (GAT): 


Performance 


for  all 


Uncertainty 


iff 


Analysis  Test 


As  implied  by  the  form  of  this  “theorem",  this  chapter  will  focus  on  necessary  and  sufficient  conditions 
for  performance  in  the  presence  of  uncertainty.  The  uncertainty  will  be  combinations  of  input  signals, 
perturbations,  and  parameter  variations.  Performance  will  be  simply  stability  or  stability  plus  a  bound  on 
the  error  e.  It  is  hoped  that  by  organizing  the  many  alternative  methods  of  performance  and  robustness 
analysis  in  this  way,  it  will  be  easier  to  compare  the  assumptions  and  their  relative  merits. 


1.3  Nominal  Performance 


This  section  considers  performance  in  terms  of  bounds  on  e  in  the  presence  of  uncertain  bounded  inputs 
v.  Bounds  for  both  v  and  t  are  expressed  in  terms  of  signal  power,  energy,  or  magnitude.  Such  descriptions 
are  standard  within  the  control  theory  community  and  a  rigorous  treatment  will  not  be  given  here.  The  focus 
is  on  comparing  the  resulting  tests  on  G  implied  by  the  alternative  assumptions.  While  other  assumptions 
on  signals  could  be  considered,  these  are  the  most  common  both  in  the  literature  and  in  practice.  To  simplify 
the  comparisons,  assume  temporarily  that  the  signals  are  scalars  so  that  spatial  norms  are  not  an  issue.  Thu 
will  focus  attention  on  the  contrasts  between  the  alternative  descriptions  of  the  time  content  of  the  norms, 
which  is  far  more  significant. 

The  three  alternative  assumptions  about  the  signals  is  that  they  are  bounded  in  “average  power”,  “total 
energy”,  or  magnitude.  The  terms  power  and  energy  are  used  here  in  a  generalized  sense  to  indicate  that 
integrals  of  the  square  of  the  signals  are  involved.  This  is  standard  usage  of  these  terms  within  the  control 
community.  Suppose  that  v  is  a  function  of  time  snch  that  on  any  finite  interval  it  is  square  integrable.  Then 
we  may  obtain  bounds  on  v  in  terms  of: 

(1)  Power  BP  =  je  j  jlimo  ^  J  ^  |v(*)|*  dt  <  1 

(2)  Energy:  BL ,  =|n|lH|*  -J  |v(f)|*  dt  <  1 

(3)  Magnitude:  BLoo  =  j  v  |  ||i'||00  =  esssup|v(<}|  <  1 

The  prefix  B  denotes  the  unit  balL  The  bounds  are  scaled  to  1  since  any  other  scaling  can  simply  be 
absorbed  into  the  interconnection  structure  G.  Likewise,  any  weighting  or  coloring  filter  can  be  absorbed 
into  G  so  that  only  unweighted  signals  need  be  considered.  Note  that  in  practice,  the  use  of  weightings  on 
both  v  and  e  are  essential  to  reflect  the  varying  spatial  and  frequency  content  of  both  the  input  signals  and 
the  performance  specifications.  For  simplicity,  all  signals  will  be  assumed  to  be  complex. 


For  nominal  performance  the  GAT  takes  the  form: 


GAT  for  Nominal  Performance: 


e  €  Performance  Set 


for  all 


v  €  Input  Set 


iff 


Norm  Test 


Here  the  performance  and  input  sets  are  taken  to  be  either  BP,  BL a,  or  BLoo  as  defined  above  with 
the  additional  input  set  of  sinusoids  also  being  considered.  Table  1  gives  the  norm  test  for  each  combination 
that  makes  sense  where 

If00  \ 1,7 

—  y  trace  [G(ju)*  G(ju) J  duj 


||G||oo=  sup  y(G(;w)j 

W 

iwu-  r  woi*. 

Jo 

To  iiCjjoo  norm  is  defined  for  matrices  since  this  will  be  used  in  the  remaining  sections. 


Table  1.  Performance  Summery 


e 

V 

Power 

Energy 

Magnitude 

Power 

Halloo 

00 

00 

Energy 

D 

l|G|U 

INIs 

Magnitude 

IICIU 

00 

lltflll 

Sinusoids 

IIClIoo 

00 

INI* 

In  each  case  the  norm  test  is  the  indicated  norm  being  less  than  or  equal  to  one;  the  norm  would  be 
applied  to  ( the  subscript  is  dropped  in  the  table).  For  example,  the  upper  left  band  corner  case  would 
yield  the  following  version  of  the  GAT: 
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Performance  Theorem  (Power/Power): 

e  6  BP  V  v  €  BP 

iff  l!C n  Hoc  S  1* 

Some  of  the  entries  in  Tnble  1  are  either  0  or  oo.  The  0  occurs  because  tor  any  stable  G,  v  €  BLi  yields 
e  of  zero  average  power.  The  oo  occurs  because  for  the  indicated  combinations  any  G  jt  0  allows  unbounded 
outputs  tor  some  bounded  inputs.  These  combinations  are  dearly  of  no  practical  or  theoretical  interest. 

Note  that  ||C||oo  is  the  clearly  the  most  common  norm  in  Table  1  and  will  be  the  norm  used  in  the 
rest  of  this  paper.  It  results  from  several  different  assumptions  of  practical  and  theoretical  interest.  Perhaps 
the  most  useful  are  v  g  BP  and  e  €  BP  and  the  cases  of  sinusoidal  inputs.  The  vector  case  is  no  different 
provided  the  spatial  part  of  the  vector  norm  in  each  case  is  taken  to  be  the  usual  Euclidean  2-norm.  The 
||C>||]  and  ||p||i  norms  each  appear  only  once.  The  ||G||j  appears  for  v  G  BLi  and  e  G  BL <».  While  this 
appears  to  be  a  strange  combination,  it  could  have  some  significance  in  some  problems.  To  my  knowledge, 
no  one  has  described  control  problems  in  this  way. 

The  standard  assumptions  which  lead  to  ||G||s  are  not  covered  in  the  table  but  concern  situations  where 
either  v  has  a  fixed  power  spectrum  and  e  G  BP  or  p  is  a  stochastic  process  with  fixed  power  spectral 
density  function  and  performance  is  measured  in  terms  of  the  variance  of  e.  The  ||G||j  also  arises  when  v 
is  a  fixed  signal  and  e  6  BLt.  Zames  and  others  (|Z2],[Z3)}  have  argued  that  these  assumptions  are  often 
not  appropriate  for  control  problems.  At  the  heart  of  this  argument  is  the  observation  that  when  frequency- 
dependent  weights  are  use  to  shape  the  spectral  content  of  signals  and  performance  specifications,  it  may  be 
a  better  model  of  physical  reality  to  view  inputs  as  being,  for  example,  bounded  in  power  than  characterized 
by  a  perfectly  known,  fixed  power  spectrum.  In  addition,  it  is  usually  quite  easy  to  find  performance  weights 
that  turn  a  problem  specified  in  terms  of  ||  •  ||j  into  one  involving  only  ||  •  ||oo-  While  this  would  be  of 
no  particular  value  when  only  uncertain  inputs  are  considered,  it  could  prove  quite  useful,  as  will  be  seen, 
when  uncertainty  in  the  form  of  plant  perturbations  are  included.  While  these  issues  still  remain  largely 
unresolved,  it  is  clear  that  the  popularity  of  ]|  •  ||j  is  due  in  great  part  to  its  convenient  mathematical 
properties,  which  are  substantial  and  well-known.  Fortunately,  this  distinction  between  ||  •  ||j  and  ||  •  ||oo  » 
becoming  less  significant. 
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The  jjfflji  norm  cannot  be  dismissed  as  lightly.  The  assumptions  which  lead  to  it,  that  both  v  and  t 
are  in  Sloe.  are  very  appealing.  It  is  often  the  case  in  practice  that  the  critical  issue  is  the  magnitude 
of  signals  and  not  their  power  or  energy.  In  fact,  it  could  be  argued  that  this  would  be  the  obvious  norm 
of  choice  for  most  engineering  problems  were  it  not  for  the  mathematical  difficulties  associated  with  ||p||i. 
For  example,  it  seems  more  difficult  to  shape  the  spectral  content  of  signals  in  BLao  using  weights  than  for 
signals  assumed  to  be  in  BP.  This  is  of  great  practical  significance  since  it  is  typically  critical  in  achieving 
good  designs  to  take  advantage  of  what  is  known  abont  the  frequency  content  of  signals.  In  addition,  ||^ || t  is 
very  difficult  to  work  with  analytically.  It  has  no  useful  sets  of  invariants  analogous  to  the  inner  or  all-pass 
functions  for  ||G||a  and  ||Cr||oo>  There  is  no  synthesis  methodology  for  optimising  ||p||,  except  in  very  special 
cases  [Pearson). 

Given  the  difficulties  associated  with  directly  synthesizing  for  ||p||i  and  its  potential  practical  impor¬ 
tance,  it  is  interesting  to  ask  bow  |jp||i  relates  to  l|G||eo.  In  particular,  it  is  important  to  know  that  optimizing 

||£||oo  W>11  not  do  great  violence  to  ||p||i.  Since  the  constant  term  in  G  simply  adds  to  ||p||i,  suppose  for  the 

moment  that  G  is  strictly  proper.  It  can  be  shown  that 

Halloo  <  II0II1  <  (2n  +  l)||G||oo  (2.1) 

where  n  is  the  McMillan  degree  of  G.  The  left  hand  bound  is  well  known  and  the  right  hand  side  is  proven 
at  the  end  of  this  section.  In  fact,  a  stronger  result  is  proven,  namely  that 

IMI.  <*!><  (2.2) 

1 

where  {*, }  are  the  singular  values  of  the  Hankel  operator  associated  with  G.  Given  that  the  response  of 
many  systems  can  be  approximated  at  least  crudely  by  fairly  low  order  systems,  this  bound  suggests  that 
||G(Joo  may  often  be  a  reasonable  approximation  to  ||{?||t. 

It  should  be  noted  that  examples  can  be  constructed  for  any  rt  so  that  all  the  bounds  in  (2.1)  and  (2.2) 
are  achieved.  The  examples  that  achieve  the  upper  bounds  are  pathological  and  require  G  to  have  poles  and 
zeros  widely  spread  and  the  inputs  to  have  very  broad  spectral  content.  This  suggests  that  even  more  useful 
bounds  may  be  obtained  when  additional  assumptions  such  as  restrictions  on  pole  and  zero  locations  and 
input  signal  bandwidth  are  imposed.  This  is  clearly  an  area  that  will  require  additional  research. 
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Proof  of  bound  ]|fl||i  <  2n)]G|!oo  •  (  the  idea  for  this  proof  is  due  to  I.  Gohberg) 

Let  G(s)  =  e(»/  -  A) ~ 1 6,  where  (A,b,c)  is  an  internally  balanced  minimal  realization  (Moore)  so  that 

f  ff*  ‘  e'e^dt  =  f  bb'e*  ldt 

J  o  J  o 

=  =  diag(*i,*s, •••,»»)>  0.  (2.3) 

Let  <M0  =  ^  e'  e^‘6  and  0*(O  =  --L  c  e^'e,  where  e*  is  the  i**  unit  vector.  Note  that  (2.3)  implies  that 
||*||t  =  IIV’.lls  =  1.  Expanding  g{t)  =  c  e^'6  =  £<  ffj  V»i(|)  di(|)  yields 


<  2  52  J  ItMO  ^•('■)|* 


<  2$!  *< 


Furthermore,  since  ||G||oo  >  <r,  ,  Vi ,  Htfll,  <  2njjG|)  OO  * 
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1.3  Robust  Stability  and  Robust  Pcrformanca 

la  this  section,  we  will  consider  plant  perturbations,  a  type  of  uncertainty  entirely  different  from  uncer¬ 
tain  input  signals.  Since  plant  perturbations  can  destabilize  a  nominally  stable  system,  the  first  issue  to  be 
addressed  is  robust  stability.  Robust  performance  will  be  treated  at  the  end  of  this  section.  In  what  follows, 
it  makes  no  difference  whether  A  is  a  constant  complex  ,  rational,  or  real-rational  matrix  so  for  simplicity  it 
will  be  assumed  constant  complex  Stability  will  be  taken  to  mean  that  the  perturbed  system  has  no  closed 
rhp  poles.  Under  these  assumptions,  we  have  the  following  simple  and  well-known  theorem([Zl],(D10|): 

Theorem  RSU  (Robust  Stability,  Unstructured): 

F.(C,A)  stable  V  A  ,y(A)  <  1 

iff  llCjtlloo  <  I- 

While  the  Hslloo  norm  bad  some  reasonable  competition  for  analyzing  nominal  performance,  it  is  clearly 
the  norm  of  choice  for  robustness  analysis.  While  it  is  possible  to  use  other  norms  in  theorems  such  as  the 
above  to  obtain  sufficient  conditions  for  stability,  only  j|  •  j|oo  yields  necessary  and  sufficient  conditions.  The 
only  change  that  can  be  made  and  still  have  iff  is  to  allow  other  spatial  norms.  In  contrast,  ||  •  ||,  norm 
cannot  even  be  used  to  obtain  sufficient  conditions  for  robust  stability. 

The  term  unstructured  refen  to  the  fact  that  A  is  assumed  to  be  bounded  but  otherwise  unknown. 
Typically  weights  are  used  when  modeling  plant  uncertainty  to  reflect  the  frequency  and  spatial  variation  of 
the  perturbations.  These  weights  can  always  be  absorbed  into  the  nominal  interconnection  structure  so  in 
that  sense  it  is  no  loss  of  generality  to  assume  a  uniform  norm  bound  on  A.  It  is  in  the  assumption  that  no 
structural  information  is  available  for  A  that  limits  the  usefulness  of  Theorem  RSU.  In  practical  problems, 
it  is  generally  the  case  that  the  uncertainty  consists  of  parameter  variations  and  multiple  norm-bounded 
perturbations.  Using  only  a  single  norm-bounded  perturbation  for  analysis  is  rarely  adequate.  Parameter 
variations  typically  arise  because  of  uncertain  coefficients  in  differential  equation  models  of  physical  systems 
and  involve  real  scalars.  Norm-bounded  perturbations  often  arise  when  trying  to  capture  the  effect  of 
unmodeled  dynamics  and  are  themselves  dynamic  systems.  This  would  typically  lead  to  norm-bounded 
real-rational  perturbations,  but  for  analysis,  it  is  sufficient  to  instead  consider  constant  complex  matrix 
perturbations. 
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Any  linear  interconnection  of  inputs,  outputs,  transfer  functions,  parameter  variations,  and  perturba¬ 
tions  can  be  rearranged  to  fit  the  diagram  in  figure  2,  where  7(A)  <  1  but  A  is  block-diagonaL  This  is  an 
obvious  consequence  of  the  fact  that  composition  of  linear  fractional  transformations  are  linear  fractional, 
and  it  holds  for  perturbations  to  transfer  functions  as  well  as  elements  of  state-space  realizations.  Reducing 
to  the  uniform  norm  bound  typically  requires  the  absorption  into  the  nominal  interconnection  structure  of 
scalings  and  weights.  Then  A  will  be  a  member  of  a  set  like 


or  its  bounded  subset: 


A  =  {diag  (d|,#3,...,#,n,  A|,  Aj, ....  Aa) 
I  *  €  R,A,  € 


BA 


|A€  A 


(3.1) 


(3.2) 


It  is  possible  to  define  more  general  sets  involving,  for  example,  repeated  perturbations,  and  these  will  be 
considered  in  Section  5.  Nonsquare  perturbations  can  easily  be  handled  in  what  follows  by  augmenting  the 
interconnection  structure  with  rows  or  columns  of  zeros.  It  should  be  noted  that  although  the  block  diagonal 
perturbation  structure  with  square,  uniformly  bounded  blocks  can  be  used  without  loss  of  generality,  it  may 
be  desirable  from  a  computational  point  of  view  to  use  other  structures.  This  particular  structure  is  chosen 
because  it  is  mathematically  general  and  conceptually  elegant. 


Given  A  €  BA  Theorem  RSU  could  be  used  to  obtain  sufficient  conditions  for  robust  stability,  but 
the  test  could  be  arbitrarily  conservative.  That  is,  it  is  easy  to  construct  examples  where  ||Gu||oo  can  be 
made  arbitrarily  large  but  no  A  €  BA  leads  to  instability.  In  order  to  obtain  a  precise  generalization  of 
Theorem  RSU  to  handle  structured  uncertainty,  we  need  the  structured  singular  value,  p  [D13|.  The  positive 
real-valued  function  satisfies  the  property 

det  (/  -  A/A)  0  for  V  A  €  A, 7(A)  <  i 

iff  <  1.  (3.3) 


Note  that  p  is  a  function  of  A/  that  depends  on  the  structure  of  the  A's  in  A.  This  dependency  is  typically 
not  represented  explicitly.  If  ji(A/)  jt  o,  that  is  3A  6  A  such  that  det(7  -  A/A)  =  0,  then 


1 

#*(  A/) 


mm 

a^A 


jff(A)  det(J  -  A/A)  =  0  j 


(3.4) 


Unfortunately,  (3.4  )  is  not  typically  very  useful  in  computing  ft  since  the  implied  optimization  problem  is 
cumbersome  and  can  have  multiple  local  maxima  which  are  not  global.  Computation  of  ft  is  a  complicated 
problem  and  some  results  will  be  given  in  Sections  5  and  6.  For  now,  assume  ft  is  the  function  defined  above. 

With  these  definitions,  the  correct  generalization  of  Theorem  RSU  to  structured  uncertainty  is 

Theorem  RSS  (Robust  Stability,  Structured)  : 

F,(G,  A)  stable  VA  €  BA 

iff  IK»u|Im  S  1 

where 

=f  sop*i[G(;'u>)].  (3.5) 

Note  that  ||G||„  is  not  actually  a  norm,  but  the  notation  is  convenient.  Note  also  that  it  depends  not 
only  on  G  but  also  the  assumed  structure  of  A. 

The  methods  outlined  above  allow  for  analyzing  either  nominal  performance  or  robust  stability.  Obvi¬ 
ously,  it  would  be  desirable  to  treat  performance  with  both  noise  and  perturbations  occuring  simultaneously 
jD14|.  The  following  theorem  addresses  exactly  this  problem.  The  proof  is  given  at  the  end  of  this  section. 

Theorem  RP: 

F.(G,  A)  stable  and  ||F,(G,  A)|too  <  1  VA  6  BA 
iff  ||C||„  <  1 

where  ft  is  taken  w.r.t.  the  structure 

A  =  (A  =  diag  (A,  A,+i)  A  €  A}. 

This  theorem  is  the  real  payoff  for  using  ft.  It  gives  necessary  and  sufficient  conditions  for  robust 
performance  in  the  presence  of  structured  uncertainty.  It's  made  possible  by  the  equivalence  of  performance 
and  robust  stability  when  using  ||  •  ||oo>  The  block  A„+t  may  be  thought  of  loosely  as  a  “performance  block” 
used  to  turn  the  performance  condition  into  a  robust  stability  condition  and  finally  into  a  test  using  ft.  Note 


that  /j  ia  computed  for  the  full  G  and  is  taken  with  respect  to  an  augmented  structure.  The  analysis  results 
presented  in  this  paper  are  summarized  in  Table  2. 

Thble  2.  Analysis  Summary 


Performance 


Perturbation 


e  €  BP 


No  C+  poles  ||^7]j||ao  —  1 


y(A)  <  l 

!K?ll||oo  $  1 

IIgIIm  <  i 

A  €  BA 

Il^n  IU  <  i 

llGIU  <  i 

Proof  of  Theorem  RP:  From  Theorem  RSU, 

IIA(G,A)|U<1 
iff  det(7-F.(G,A)A„+1)*0, 
V*  =  ju  ,  VA„+i  ,  *(A„+i)  <  1 . 


Similarly, 


A(G\A)  is  stable  VA  €  BA 
iff  det(/  -  Gu  A)  ji  0  Vs  =  ju  ,  VA  €  BA  . 


det  (7  -  G A)  =  det  (7  -  G,  A)  det  (7  -  F,  (G,  A)  A.+, ), 
the  result  follows  immediately  from  definition  of  jj  and  II  ■  IL. 
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1.4  Overview  of  Optimal  Synthesis  Theory 

The  previous  sections  showed  how  nominal  performance  and  robust  stability  with  unstructured 
uncertainty  could  be  treated  using  the  H framework.  More  complicated  issues  like  structured  uncer¬ 
tainty  and  robust  performance  require  it.  This  section  contains  a  review  of  the  general  H2  and  //» 
optimal  synthesis  theory  using  the  framework  depicted  in  Figure  1-3.  The  approach  outlined  in  this  sec¬ 
tion  was  developed  in  Doyle  (1983,1984),  Chu  and  Doyle  (1984,1986),  and  Chu  (1986).  Although  the  focus 
of  this  report  is  on  H x  methods,  it  is  useful  to  put  these  newer  methods  in  a  context  which  includes  the 
more  familiar  H2  theory.  The  general  approach  taken  in  this  section  makes  this  easy  since  the  H2  and 
theory  can  be  developed  in  parallel  up  to  the  final  step,  called  the  General  Distance  Problem  (GDP), 
which  is  the  subject  of  the  remainder  of  this  report. 

In  Figure  1-3,  the  transfer  function  matrix  P  is  the  interconnection  structure  from  the  nominal 
model  of  the  system  and  the  transfer  function  matrix  K  is  the  ’’controller”  to  be  designed.  The  variable 
v  consists  of  all  external  inputs,  e  are  the  error  signals  which  are  to  be  regulated,  u  are  the  control 
inputs,  and  y  are  the  measurements.  This  general  framework  covers  all  standard  lumped  linear  time- 
invariant  filtering  and  control  problems.  Attention  will  be  restricted  to  the  lumped  case  so  all  transfer 
functions  will  be  assumed  to  be  rational. 


Partitioning  P  accordingly,  the  closed-loop  transfer  function  matrix  can  be  written  as  the  following 


linear  fractional  transformation  (LFT) 


e  -  Fi{P  ,K)  v  -  (Pn  +  Pl2K{I-PaKYlPa)v 


(4-1) 


where 


P 


6  R 


(»l  *■  f  j)x(«|  *  Wj) 


€  R 


t,  X  I 


(4-2) 


The  H2  and  H <*,  synthesis  problem  is  one  of  finding  a  stabilizing  K  € 


SUch  that  the  perfor¬ 


mance  measure 


1 1  F|  (P  ,K  )  |  |a  for  a  **  2  or  oo 
is  minimized.  For  nontriviality,  assume  that  p  l  >  m2  and  m,  >  p  2. 

The  first  step  is  to  find 


K. 


*u  K* 

K2\  Kn 


€  R 


(*2  ♦  r^xfj,  f 


such  that 


(4-3) 


(4-4) 


F,(  P  ;  F,(  K,  ;<?))-  F,(  T  ;  (?)  -  Tn  +  TnQT2X  6  /W*1*"1  (4-5) 

is  stable  and  affine  for  any  Q  €  RH^,**'*  ■  This  is  the  "Youla  parametrization'  of  all  stabilizing  con¬ 
trollers  and  is  obtained  by  finding  coprime  factorizations  of  P  over  the  ring  of  stable  rationale  and  solving 
a  double  Bezout  identity  to  obtain  the  coefficients  of  K,  (Youla,  Jabr,  and  Bongiorno  (1976),  Desoer,  Liu, 
Murray,  and  Sacks  (1980)  ).  For  simplicity,  the  superscripts  denoting  spatial  dimensions  will  be 
suppressed  in  the  remainder  of  this  section.  All  quantities  are  assumed  to  be  of  compatible  but  otherwise 
arbitrary  dimension. 


We  are  interested  in  a  particular  K,  which  results  in  TX2  and  T 2i  being  inner  and  co-inner  respec¬ 
tively;  that  is,  T&Tl2  -  /  and  T nT ^  **  /  .  This  requires  a  coprime  factorization  with  inner 


numerator.  In  addition,  T ^  and  can  be  found  so  that  ^Tl2  T^j  and 
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LriJ 


are  square  and  inner. 


T |  and  T  .  are  called  complementary  inner  factors  (CIF).  These  factorizations  can  be  carried  out  using 


10 


standard  real  matrix  operations  on  state-space  representations  (  Doyle  (1983,1984),  Chu  and  Doyle  (1984) 
).  Computation  of  state-space  realizations  of  K, ,  T,  T j,  and  from  one  for  P  involves  solving  two 
standard  algebraic  Riccati  equations  (ARE). 

Because  both  the  ||  •  ||2  and  ||  •  1 1^  norms  are  invariant  under  multiplication  by  square  inner 
matrices,  an  alternative  expression  is  possible.  For  any  Q  €  /?//*, 


II  Tu  +  Tl2QT2l  |U 


-  T ^  r2]  (Tu  +  T l2QT 21) 

>«' 

7i. 

• 

- 

‘-T&TnTZ-Q  -TZTnT f 
T{TuT&  T[TttTf  . 

or 

^  12 

P  21  ft  22 


(«) 


where 


(4-7) 


A  state-space  realization  for  R  can  be  obtained  from  a  corresponding  one  for  P  using  the  factoriza¬ 
tions  involved  in  obtaining  K,  (Doyle  (1984).  Chu  (1985)).  In  particular,  this  approach  yields  ao  R  with 
all  its  poles  in  the  open  rhp  ,  i.e.,  R  is  completely  unstable. 

Up  to  equation  4-6  the  H2  and  Hx  problems  can  be  handled  in  parallel  and  the  same  factorization 
techniques  can  be  used  to  reduce  4-3  to  4-6.  It  is  in  minimizing  4-6  that  the  two  cases  differ  substantially. 
Here,  the  a  »  2  case  is  particularly  simple.  Since 

(4-8) 


the  optimal  Q  can  be  found  from 

min  1 1  ft  u  -  Q  I U  —  1 1  ft  n  _  Qipt  I  lz 

q  €  hhx 


ftn-Q 

R  12 

|  ii 

2 

0 

S3 

u 

ft  21 

R  22 

,  -  1 1 

♦ 

2 

ft  21 

R  22 

(4-9) 


A  \. 
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Q.,t  =  P  Ht(R  n)  +  —  R  n(<x).  (4-10) 

Equations  4-9  and  4-10  follow  immediately  from  the  Hilbert  space  structure  of  l2  and  the  fact  that  Ru 

is  completely  unstable.  In  most  H2  problems,  Rn  is  strictly  proper,  so  Qlfi  «■  0  and  K,fl  ■■  K n. 


The  case  of  a  **  oo  (//^-optimization)  is  substantially  more  complicated  and  the  corresponding 
optimization  problem  in  Eq.  4-6  is  refered  to  in  this  report  as  the  "general  distance  problem"  (GDP). 
This  terminology  arises  from  viewing  the  optimization  in  4-6  as  follows: 


Given  R 


R  n  R  12 
R  21  R  22 


€  L  a,  ,  find  the  optimal  Q  €  R such  that 


1, 


min 


Rn-Q  R 

Rn  ° 


12 


1  22 


(4-11) 


is  minimized.  Note  that  the  minimum  norm  is  the  distance 


-7,  —  ditt  (  R  , 


//.  0 

0  0 


(4-12) 


from  R  to  the  set  of  (matrix)  functions  of  the  form 


Q  0 
0  0 


,  Q  €  Ha 


(4-13) 


This  class  of  problems  will  be  called  the  "4-block  problem"  in  this  report  to  distinguish  from  the 


special  case  where  £/?2,  22  J  or 


R 12 
R  22 


is  identically  zero.  The  latter  will  be  referred  as  the  "1-block 


problem".  If  both  /?aj  and  ^ 


are  zero,  this  is  known  as  the  "best  (or  Hankel)  approxima¬ 


tion"  problem  (eg.  Adamjan,  Arov,  and  Krein  (1971,1978),  Sarason  (1967),  Glover  (1984))  since 


11  Ril  ~  Q  'I®  “  11  (4-14) 

This  problem  has  the  special  property  that  it  reduces  to  a  finite  dimensional  eigenvalue  problem  and  solu¬ 
tions  can  be  computed  with  standard  matrix  algebra  routines.  Note  that,  in  general,  a  Q,pl  which 
achieves  the  minimum  in  4-14  does  not  minimize  the  GDP  in  4-11,  although  it  will  be  shown  to  provide  3 


reasonable  approximation  to  the  actual  minimizer. 


1.5  p  For  Complex  Perturbations 


In  the  previous  sections,  it  was  shown  that  robust  performance  and  stability  with  structured  uncertainty 
reduces  to  computing  p  for  constant  matrices  G(jv)  and  then  taking  sup  over  all  w.  For  this  to  be  useful, 
we  must  have  ways  of  computing  p  or  bounds  for  it.  This  section  will  begin  by  outlining  some  of  the 
mathematical  properties  of  p  for  complex  perturbations  and  viewing  it  as  a  natural  generalization  of  the 
spectral  radius  p,  and  the  spectral  norm  (maximum  singular  value)  9  .  The  rest  of  this  section  will  focus  on 
using  scalings  to  characterize  p  in  terms  of  p  and  9. 


Suppose  that  A  is  some  subalgebra  of  matrices  satisfying 

(A/|A  €  C}  C  A  C  C**".  (5.1) 


In  this  report  we  will  be  interested  in  block  diagonal  A.  Define  the  spectrum,  «p(A/),  and  inverse  spectrum, 
i*p(A/),  of  a  matrix  M  €  C'Vx,v  with  reiptct  to  the  nbatgebn  A  as 


«p(A/)  =  j  A  €  A 
itp(M )  -  (a  €  A 


det(M  -  A)  =0j 

det(/  -  A/A)  =  0  J  .  (5.2) 


Since  both  sets  depend  on  A  it  would  be  appropriate  to  subscript  the  symbols,  but  to  keep  notation  simple 
this  will  be  avoided  throughout.  The  set  ip(M)  is  a  natural  generalization  of  the  usual  notion  of  spectrum 
and  is  always  nonempty.  In  this  context,  p  can  be  viewed  as  a  natural  generalization  of  spectral  radius  since 
it  b  easily  verified  that 

p(M)  =  sup  z(A).  (5.3) 

A6  »t{M) 

If  p(M )  jt  0  (which  b  equivalent  to  i»p(M)  /  0)  then 


p(M)  =  sup  —r. 


(5.4) 


This  characterization  emphasizes  the  view  of  p  as  a  generalization  of  9  and  b  simply  a  restatement  of  (3.4). 
Indeed,  in  the  special  cases  where  A  b  equal  to  one  of  its  possible  extreme  sets  in  (5.1),  p  b  exactly  either 
the  usual  spectral  radius  or  maximum  singular  value: 


A  =»  { A/JA  €  C)  »  p(M)  =  p(M) 
A=*CNkN  =>  p(M)=9(M) 


(5.5) 
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It  is  possible  to  use  these  two  special  cases  to  obtain  bounds  for  p.  For  any  set  A  it  easy  to  see  that 


p(M)  <  n(M)  <  0  (AS) 


(5.6) 


but  these  bounds  are  not  directly  useful  for  computation  as  matrices  may  be  found  that  make  the  differences 
between  the  bounds  and  p  as  large  as  desired. 

It  is  possible  to  improve  the  bounds  in  (5.6)  by  using  simple  properties  of  A.  Suppose  that  U  and  D 
are  sets  such  that  for  any  A  €  A 

UeU  =»  *(17A)  =  *(A) 

(5.7) 

De  D  =>  D~lAD  =  A. 

Then  it  is  easy  to  see  from  the  definition  of  p  that 

UeU  m  p(MU)  =  p(M) 

(5.8) 

DeD  =>  p(DMD~l)=p(M) 

so  the  bounds  in  (5.6)  can  be  improved  to 

sup  p{MV)  <  p(M)  <  inf  9  (DMD~l) .  (5.9) 

The  key  theorems  about  p  show  when  these  inequalities  are  actually  equalities. 


Let  us  first  consider  the  case  where  all  the  blocks  are  complex  and  none  are  repeated.  Then  we  have 
the  sets 


A  =  jdiag  (Ai,  A, . A,)  | 

U  =  (diag^.V, . Un) 

D  =  jdiag(d|/,<fj/,. . .  ,</*/) 


|  A }  6 

v;u,  =  ij 
di  e  R+j 


(5.10) 


It  is  easy  to  verify  that  (5.7)  holds  so  that  the  inequalities  in  (5.9)  apply.  What  is  more  important  is  that 


holds  for  all  AS  and  A  and 


sup  p(MU)  =  jj(AS) 
v&i 

„(A/)=  inf  *  (DM D~l) 


(5.11) 


(5.12) 


V  •  .  •  .T,  jy>  V.-V,  „ 


if  n  <  3  (  three  or  fewer  blocks  )  [D 13].  There  are  other  conditions  under  which  this  upper  bound  is  an 
equality  but  they  are  more  cumbersome  to  state  and  generally  of  less  interest.  An  example  of  strict  inequality 
for  the  upper  bound  has  been  found  for  n  =  4.  Extensive  computational  experimentation  has  yet  to  find  a 
matrix  for  which  the  upper  bound  exceeds  p  (actually  some  lower  bound  for  p)  by  more  than  15%,  and  the 
upper  bound  is  nearly  equal  for  most  matrices.  This  seems  to  be  independent  of  matrix  size  and  number  of 
blocks.  This  is  encouraging  but  additional  theoretical  work  is  needed  to  guarantee  the  quality  of  the  upper 
bound  in  general. 

The  case  of  repeated  blocks  is  less  well  understood.  To  see  what  U  and  D  arise  when  there  are  repeated 
blocks  consider  the  simple  case  where  each  block  is  a  repeated  scalar 


▲  «  jdiag («,W . «./)  «>€CJ 

U  =  jdiag («[/,«,/, ...,u„/)  Uj  €  C,|u,|  =  1  j 
D  =  jdiag  (Dt,Z?Jt.  ...A,)  Z>,  invertible  j 


It  is  possible  to  restrict  the  D  €  D  to  positive  definite  Hermitian  matrices  (D,  =  D*  >  0)  without  loss  of 
generality.  As  above,  the  inequalities  in  (5.9)  hold  and  (5.11)  also  holds  for  all  M  and  A.  Unfortunately,  the 
conditions  under  which  the  upper  bound  is  an  equality  are  not  easily  checked.  The  computational  experience 
with  the  case  of  repeated  blocks  is  much  more  limited  than  with  nonrepeated  blocks,  but  the  evidence  so  far 
suggests  that  the  upper  bound  is  also  nearly  an  equality.  The  case  of  repeated  nonscalar  blocks  is  just  the 
obvious  combination  of  the  the  above  two  cases. 

The  lower  bounds  in  terms  of  p(MU )  have  the  desirable  property  of  always  achieving  p  independent 
of  the  number  of  blocks.  Unfortunately,  p(MU)  can  have  multiple  local  maxima  which  are  not  global  so 
direct  computation  of  (5.11)  by  gradient  search  may  not  find  the  actual  maximum.  At  this  time  there  is  no 
alternative  scheme  guaranteed  to  find  the  global  maximum  that  has  reasonable  computational  properties. 
Fan  and  Tits  (1985)  do  have  an  alternative  scheme  for  a  lower  bound  which  does  not  guarantee  that  p  will 
be  found  but  appears  to  be  very  fast  and  has  many  advantages  over  using  (5.11). 

The  upper  bound  in  (5.9)  is  more  easily  found  since  the  expression  9  ( DMD~l )  has  only  global  minima. 
This  is  a  direct  consequence  of  the  fact  that  9  ( eDMe~D )  is  convex  in  D.  This  fact  was  used  in  [D13]  to 


argue  that  the  upper  bound  in  (5.9)  probably  offered  a  reasonable  alternative  to  (5.11)  (or  computation  of 
H-  The  original  proof  of  convexity  was  rather  cumbersome  and  appeared  later  in  [S5j.  A  much  simpler  proof 
is  included  at  the  end  of  this  section. 

Computational  experience  to  date  has  indicated  that  it  is  desirable  in  practice  to  use  both  upper  and 
lower  bounds  for  fi,  since  the  existing  bounds  nicely  complement  each  other.  The  upper  bound  is  easily 
computed  but  may  not  give  p  except  in  special  cases.  On  the  other  hand,  it  appears  to  be  nearly  equal  to 
ft  in  all  cases.  The  existing  lower  bounds  (including  both  (5.11)  and  those  Fan  and  Tits)  are,  in  principle, 
equal  to  /i  in  all  cases  but  may  fail  because  of  local  maxima.  By  having  an  upper  bound  it  is  much  easier  to 
recognize  when  a  local  maxima  is  not  global  and  restart  the  algorithm  with  another  initial  guess.  Extensive 
computational  experience  has  yet  to  reveal  a  (complex)  n  problem  where  the  bounds  obtained  in  this  way 
differed  by  more  than  about  15%.  More  research  is  needed  to  show  whether  this  is  always  true.  It  could 
simply  turn  out  that  counterexamples  exist  but  are  difficult  to  find. 


Proof  of  convexity  of  9  (e°Me~D) 


This  proof  is  based  on  the  following  simple  lemma.  Suppose  that  /  :  R  >->  R  is  continuous. 


Lemma.  Suppose  V*,  3 p,  :  R  •-»  R  twice  differentiable  such  that  /(x)  =  p,(x),  /(/)  >  p,(l)  and 
a^rt7*(f )|(_  >  0-  Then  /  is  convex 


To  prove  convexity  of  9  (e°Me~D)  it  suffices  to  prove  convexity  of  /(x)  =  9  ( eDtMe~D *)  for  arbitrary 
D  —  Dm  €  C***.  To  apply  the  lemma  define  Mm  =*  tD*Me~D*  and  let  u  and  v  be  (any)  singular  vectors 
such  that  /(x)  =  9[Mt )  =  u*A/,ti.  Then  define  p*(f)  —  S  {u*eDtMe~Dtv}.  Since  /(f)  >  p,(f)  and 


i=i 


-  »  {u*  (U*M,  -  2DM,D  +  MtD)  v} 

=  /(zHu’U’u  +  v'D'v)  -  2S  {u*DM,Dv) 


=  (u*D 


>  0 


by  the  lemma  /  is  convex 


1.6  Computation  of  ft  for  Real  Perturbation* 


The  properties  of  ft  when  A  has  some  elements  restricted  to  be  real  are  quite  different  from  the  purely 
complex  case.  Suppose  that 

A  =  {diaj  A,.  As . A,) 

*  6R.A,  (6.1) 

and  sp(Af)  and  i#p(Af)  are  defined  (or  A  exactly  as  in  (5.2).  In  this  case,  it  is  possible  for  either  *p{M)  or 
i*p( Af)  to  be  the  empty  set.  Furthermore,  (5.3)  is  no  longer  a  correct  characterization  of  ft  in  general  and 
there  is  no  natural  way  to  view  p  as  a  simple  generalization  of  the  usual  notion  of  spectral  radius.  Of  course, 
(5.4)  still  applies  provided  ft(M)  /  0  (i.e.  i*p(A/)  jt  0). 

This  section  will  focus  on  upper  bounds  to  ft  that  can  be  obtained  by  scaling  9.  The  choice  of  scaling  is 
based  on  the  following  lemma  which  characterizes  a  useful  class  of  scalings.  In  the  following  lemma,  assume 
that 

„  r»l 

T  = 

.Tji  Tjj 

and  det(f  -  TjjM)  jt  0  so  that 

f,(t ,  m  )  =  r„  +  r„  M(i  -  rjaA/)-*  r„ 


is  well-defined. 


Lemma 


Suppose  3T  such  that  BA  C  |f»(T,  A)  <r(A)  <  l| 


then  ff(F,(r,Af))  <  1  =>  ft(M)  <  1 


This  lemma  says  that  if  F,(T,  A)  ’‘covers"  BA  then  T  can  be  used  to  obtain  an  upper  bound  for  ft. 
The  proof  is  given  at  the  end  of  this  section.  The  next  step  is  to  identify  a  set  of  T's  that  satisfy  the  lemma. 
To  this  end  define 


V  —  (diag(d|,dj,. . .  ,d«,dm+|/,  dm+J/, . . .  ,dm+»  J) 
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|d,  6  R+J 

C  — -  jdiag  (c|tC)f»fCfn,010t..M0j  €  [  — 1>  1]  ^ 

T={[i-,  D€0,C€cj  (8.2) 

where  D  and  C  are  partitioned  conformally  with  A.  With  these  definitions,  for  T  €  T 

Fi(T, M)  =  jC  +  (I  -  C*) l/i  DMD~l.  (6.3)“ 

It  is  a  matter  of  some  simple  algebra  to  show  that  ail  T  €  T  satisfy  the  lemma.  Note  that  if  there  are  no 
real  parameters  (m=0),  then  Fi(T,M)  -  DA/D-1  and  this  scaling  reduces  to  that  considered  in  (5.10)  and 
(5.12).  Other  T  also  satisfy  the  lemma  but  this  parametrization  is  convenient  because  (6.3)  is  particularly 
simple. 

Obtaining  an  upper  bound  based  on  (6.3)  is  somewhat  more  complicated  than  is  possible  in  (5.12).  The 
difficulty  is  that  the  above  lemma  only  implies  that  ji(A/)  <  1  and  does  not  scale.  Using  the  sets  in  (6.2), 
we  can  define 

*(A/)«oinf+  {o|  mf.^F((r,iA/))<l}.  (6.4) 

It  follows  immediately  from  the  lemma  that 

<  ji(M) 

and  thus  ji  provides  am  upper  bound  for  /i.  Again  note  that  for  no  real  parameters,  ji(\i)  simplifies  to 

it(M)  »  M^DA/D-') 

The  natural  question  is  how  good  a  bound  is  ji  for  p.  Recall  that  for  m  =  0,  n  <  3  that  ji  =  jj  for 
all  matrices  independent  of  block  size  (m  is  the  number  of  real  parameters  and  n  is  the  number  of  complex 
blocks).  A  simple  extension  of  this  result  yields  ji  =  p  when  m  =  1,  n  <  2.  Although  counterexamples 
exist  for  problems  with  more  than  these  number  of  blocks,  experience  has  shown  that  ji  is  often  a  good 
approximation  to  jj  even  in  these  cases.  While  this  experience  is  encouraging  it  is  not  conclusive  and 
additional  research  is  needed  to  establish  the  value  of  ji.  Unfortunately,  when  there  is  more  than  one  real 
parameter  it  is  possible  for  /i(A/)  <  £(A/)  . 


Proof  of  lemnuu  Since 


det(/  -  TjjA/jdet  [I  -  F,(T,M) A) 

=  de<(/-  r„d)det  (/-  A//;(r,d)), 


<r(F,(r,A/))  <  1  =>det(/-rf(r,A/)A)  ?t0  Vff(A)  <  1 
=*  det(7  -  M  Ft(T,  A))  #  0  V*(A)  <  1 
=*  det(7  -  AM)  jt  0  VA  €  B  A 
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1.7  /J-Synthesta 

The  previous  sections  on  analysis  showed  that  the  synthesis  problem  reduces  to  finding  a  stabilizing 
controller  K  so  that 

||Fi(P, ifJIja  <  1  o  =  oo  or  n  (7.1) 

where  Fi(P,  K)  =  Pu  +  Pt *  K(I  —  Pjjfif)-1  Pjj.  The  solution  to  (7.1)  for  or  —  oo  was  outlined  in  section 
4  and  additional  details  are  presented  in  the  remaining  chapters  of  this  report.  This  JToo-synthesis  solution 
can  be  used  to  provide  an  approach  to  solving  the  /i- norm  synthesis  problem,  refered  to  as  /j-syn  thesis. 

Recall  that  the  first  step  in  the  Ha 0  synthesis  solution  involves  finding  J  so  that  the  substitution 
K  =  F,(J,Q)  yields 

F,(P,K)  =  Ft(P,Fi[J,Q))  =  R  +  UQV  (7.2) 

with  Ft{P,  K)  internally  stable  iff  Q  €  Hoo.  Further,  U  is  inner  and  V  co-inner  ( U*U  =  I  and  VV*  =  /), 
and  there  exist  complementary  inner  factors  U±  and  Vj.  such  that  \UUx]  and  ]  are  both  square  and 
inner.  The  U  and  V  are  obtained  from  coprime  factorizations  Pu  =  UM ,_1  and  P2\  —  M2lV.  The  next 
step  involves  using  a  rational  matrix  version  of  the  Davis-Kahan* Weinberger  matrix  dilation  results  [Dl|  to 
further  reduce  the  problem  to  one  of  finding  Q  €  RHoo  such  that 

!|G  +  <?lloo<l  (7.3) 

where  G  €  P£oo-  This  problem  can  then  be  solved  using  the  Hankel  norm  approximation  methods 
developed  by  Glover  [G2|.  The  resulting  optimal  Q  can  then  be  used  to  find  first  the  optimal  Q  and  then 
the  optimal  K. 

The  p-synthesis  problem  does  not  yet  have  as  complete  a  solution  as  does  the  H<x  synthesis  problem. 
A  reasonable  approach  would  be  to  try  to  find  a  stabilizing  controller  K  and  scaling  D  so  that 

||DP((P,jn£>-M|co<l.  (7.4) 

One  method  to  do  this  is  to  alternately  minimize  the  above  expression  for  either  K  and  D  while  holding  the 
other  constant.  For  fixed  D  the  left-hand  side  of  (7.4)  is  just  an  Hoc  control  problem  and  can  be  solved  using 
the  methods  reviewed  above.  For  fixed  K,  the  left-hand  side  of  (7.4)  can  be  minimized  at  each  frequency 


as  a  convex  optimization  problem  in  D.  The  resulting  D  can  be  fit  with  a  stable,  rational  transfer  function 
with  stable  inverse  (the  phase  of  D  does  not  affect  the  norm). 

This  approach  to  /i-synthesis  has  been  successfully  applied  to  several  example  problems.  In  principle,  it 
could  be  used  to  obtain  controllers  that  are  arbitrarily  close  to  (t- optimal  in  the  case  of  3  or  fewer  blocks  and 
provide  nearly  optimal  controllers  for  the  general  case.  This  would  depend  on  the  suggested  iterative  scheme 
converging  to  the  global  optimal  K  and  D.  Unfortunately,  individual  convexity  in  the  two  parameters  of  an 
optimization  problem  does  not  imply  joint  convexity,  and  this  scheme  is  not  always  guaranteed  to  converge 
globally  to  the  best  K  and  D. 

To  better  understand  the  properties  of  the  problem  in  (7.4)  it  is  useful  to  consider  the  constant  matrix 
problem.  Using  (7.2),  we  can  reduce  (7.2)  to 

\\D(R  +UQV)D~'\\X  <  1.  (7.5) 

for  constant  R,  U,  V  with  U'U  =  1  and  VV*  =  1.  For  D  =  I,  it  follows  from  |G2]  that 

min  if(*  +  t/QV)  =  max (<r(U;K),  <f(KV;))  (7.6) 

where  U±  and  are  chosen  so  that  \U  UJ  “<1  (yx  J  are  both  square  and  unitary.  All  of  these  quantities 
are  easily  computed  using  standard  SVD  routines. 

Posing  (7.5)  as  an  optimization  problems  gives 

mm  9(D(R  +  VQV)D-').  (7.7) 

It  is  known  that  this  problem  is  convex  in  either  D  (actually  ln(D))  or  Q  individually  when  the  other  is 
held  fixed,  but  is  not  convex  in  both  variables  jointly.  This  means  that  the  iterative  scheme  suggested  as 
a  possible  approach  to  ^synthesis  is  not  guaranteed  to  converge  even  in  the  constant  matrix  case.  It  is 
possible,  however,  to  compute  the  desired  D  in  (7.7)  directly. 

The  result  in  (7.6)  may  be  applied  to  (7.8)  to  obtain 

nun  9  (D(R  +  UQV)D~l)  = 


m»x{9((DU)\DRD-1)  ,  V  (£>/?D-,(V'D-');)} 


(78) 


ikJklhUl 


(DU)±  =  D-lUxiUiDr*V±)-1/*  (7.9) 

and  (VD~l)±  is  defined  similarly.  Note  that  [DU{U* D7U)~lfi  (DU)X\  is  unitary.  It  can  be  shown  that 
the  right  hand  side  of  (7.S)  is  convex  in  ln(D)  so  that  the  ‘‘optimal”  scaling  for  (7.7)  may  be  computed 
by  search  in  advance.  This  gives  a  tight  lower  bound  for  (7.7)  and  the  resulting  D  scaling  may  be  used  to 
compute  the  optimal  Q. 

A  simple  example  will  illustrate  all  the  essential  features  of  this  possibly  confusing  sequence  of  ideas. 


Consider  the  problem 


t  "([ V  I]  +  [5]  °i) 

■sy'fl./i  ?])■ 


The  ^-optimal  q  is  q  s  0  which  gives  ft  =  1.  For  fixed  d  the  7-optimal  q  =  <P  and  for  fixed  q  >  0  the 
7-optimal  d  is  d  =  ^ q .  Thus,  iteratively  solving  for  either  q  or  d  will  immediately  converge  to  the  curve 
q  =  <P.  For  example,  with  the  initial  guess  of  q  =  d  =  1,  the  iterative  scheme  will  not  change  either  q  or  d 
and  will  thus  fail  to  find  the  global  optimum. 


On  the  other  hand, 


“n  9  [,“/d  J]  =  max  (»([-!  d])  ,  »(|j])) 


=  yl  +  dP. 


nia  (nun  if  [  *])  =  min 


which  is  clearly  convex  in  both  d  and  Irtd  and  achieves  its  minimum  as  d  —  0.  U  the  expression  in  (7.8)  were 
used  to  compute  the  d  in  advance,  it  would  be  possible  to  find  the  optimal  achievable  level  for  (7.9).  This 
example  also  illustrates  why,  strictly  speaking,  inf,  not  min  must  be  used  for  the  D  scalings  as  in  (5.9).  This 
issue  will  not  be  taken  up  in  this  report.  It  turns  out  to  be  of  little  significance  anyway. 

The  simplest  application  of  these  ideas  to  the  selection  of  the  D  scalings  for  the  ^-synthesis  problem  is 
to  compute  an  initial  guess  for  D  at  each  frequency  using  (7.8).  This  would  be  the  optimal  D  for  an  acausal 
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controller,  and  should  provide  a  good  initial  guess  for  the  optimal  D  for  the  causal  controller  problem.  A 
deeper  question  is  whether  some  generalization  of  (7.8)  and  its  convexity  properties  applies  to  the  rational 
case.  While  this  seems  likely,  the  details  have  not  been  worked  out  and  the  practical  implications  are 
uncertain.  For  some  additional  results  on  /i-syn thesis,  see  [SS]. 
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1.8  Outline  of  Chapter!  2*8 

The  remainder  of  this  report  considers  various  issues  arising  in  H *  optimal  control  theory  which  are 
associated  with  a  particular  solution  approach  which  involves  reduction  of  the  standard  problem  to  a 
"general  distance  problem”  (GDP)  in  L «*.  Since  H <*,  methods  were  introduced  to  the  engineering  com¬ 
munity  by  Helton  (  1981),  Tannenbaum  (1980),  and  Zames(1981),  there  have  been  numerous  papers  on  the 
subject  from  many  points  of  view  (e.g.  Chang  and  Pearson  (1984),  Feintuch  and  Francis  (1984),  Foo  and 
Posthlethwaite  (1984),  Francis  and  Zames  (1984),  Glover  (1984),  Khargonekar  and  Tannenbaum  (1985), 
Kwakernaak  (1983),  Safonov  and  Verma  (1983),  Verma  and  Jonkheere  (1984),  to  name  just  a  few).  This 
paper  will  focus  on  the  approach  developed  in  Doyle  (1983,  1984),  Chu  and  Doyle  (1984,  1985),  and  Chu 
(1985),  which  was  outlined  in  Section  1.4.  For  an  overview  of  //*  control  theory  and  a  review  of  the 
literature  see  the  expository  paper  by  Francis  and  Doyle  (1985). 

This  report  is  divided  into  eight  chapters.  Chapter  2  expands  on  the  overview  in  Section  1.4  of  the 
general  optimal  synthesis  theory  from  Doyle  (1984),  which  includes  both  the  Hz  and  H „  optimal  control 
problems.  The  affine  parametrization  of  the  closed-loop  transfer  matrix  is  obtained  following  Youla's 
parametrization  of  all  controllers  achieving  internal  stability.  A  particular  parametrization  is  employed 
involving  coprime  factorization  with  inner  numerator;  the  //^-optimal  controller  is  found  immediately  and 
the  //^-optimization  formulation  is  transformed  to  an  equivalent  "general  distance  problem"  (GDP). 
Two  simple  examples  are  also  presented. 

Chapter  3  describes  results  on  optimal  solutions  to  the  general  distance  problem.  The  optimal  norm 
of  the  GDP  can  be  expressed  in  terms  of  an  induced  operator  norm  or  an  equivalent  eigenvalue  problem 
involving  a  combination  of  Hankei  and  Toeplitz  operators.  The  approach  is  conceptually  elegant;  how¬ 
ever,  it  does  not  yield  a  computable  formula  for  either  the  minimal  norm  or  the  optimal  solution. 

In  Section  4,  the  approach  of  7-iteration  is  introduced.  It  essentially  involves  guessing  a  7  and  then 
reducing  the  problem  to  an  equivalent  best  approximation  formulation.  The  guess  for  7  is  iterated  until  it 
converges  to  the  minimal  norm,  and  the  optimal  solution  is  thus  obtained.  Some  fairly  tight  bounds  for 
the  minimal  norm  which  are  easily  computed  are  also  given;  these  immediately  allow  for  reasonable  esti¬ 
mates  of  the  minimal  norm  as  well  as  giving  an  approximation  technique  for  obtaining  suboptimal  solu- 


/Vv.v., 
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tions  that  are  within  a  guaranteed  bound  of  optimal.  To  study  the  convergence  properties  of  the  7- 
iteration  it  is  then  viewed  as  a  problem  of  finding  the  zero  crossing  of  a  function.  It  is  established  that 
this  function  is  continuous,  monotonically  decreasing,  convex,  and  in  turn  bounded  by  some  very  simple 
functions.  These  properties  make  it  possible  to  obtain  very  rapid  convergence  of  the  7-iteration.  An 
interesting  example  is  given  to  illustrate  some  important  aspects  of  the  general  distance  problem  which 
were  not  previously  well-understood. 

The  state-space  formulation  of  the  7-iteration  is  then  developed  and  presented  in  Chapter  5.  In  the 
7-iteration,  a  key  step  is  to  find  the  spectral  factor  of  a  para-Hermitian  matrix  of  the  form  (~i2I  -  G*G ) 
(or  (72/  -  GG*))  which  typically,  requires  one  to  solve  for  a  coprime  factorization  with  inner  denominator 
and  a  standard  spectral  factorization.  Each  of  the  factorizations  requires  finding  the  stabilizing  solution  of 
an  algebraic  Riccati  equation  (ARE).  The  ARE  associated  with  coprime  factorization  has  a  special  struc¬ 
ture  where  the  constant  term  is  identically  zero.  A  very  efficient  algorithm  based  on  a  Schur  decomposi¬ 
tion  is  developed  to  solve  the  ARE  with  this  special  structure.  For  completeness,  balanced  realizations 
and  Glover's  algorithm  to  the  best  approximation  problem  are  also  reviewed.  Combining  previous  results, 
the  "closed-form"  state-space  optimal  solutioo  of  the  general  distance  problem  is  then  obtained. 

Chapter  6  recapitulates  the  results  detailed  in  Chapter  2  through  Chapter  5,  with  a  discussion  of 
some  numerical  aspects  of  the  algorithms.  Chapter  6  also  discusses  some  results  on  model  reduction  in  the 
context  of  the  GDP  as  a  method  for  obtaining  suboptimal  solutions  with  reduced  order.  The  focus  here 
is  on  model  reduction  techniques  where  the  error  produced  by  a  reduction  can  be  related  directly  to  tbe 
degree  of  suboptimality  of  tbe  resulting  solution. 


CHAPTER  2 

OPTIMAL  SYNTHESIS  THEORY 
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This  chapter  contains  a  review  of  the  general  H2  and  //„  optimal  synthesis  theory.  It  is  an  expan¬ 
sion  of  the  material  outlined  in  Section  4  of  Chapter  1  and  shows  in  detail  how  the  general  H &  problem 
reduces  to  the  “General  Distance  Problem".  Recall  that  the  framework  used  here  is  depicted  in  Figure  2- 
1.  The  approach  used  in  this  chapter  follows  closely  that  in  [D16|. 


Figure  2-1.  General  Framework  for  Synthesis 


Partitioning  P  appropriately,  the  closed-loop  transfer  function  matrix  from  v  to  e  can  be  writtened  as 
the  following  LFT 

F,(P,K)  =  Pn  +  Pl2K(I-PaK)-'P2l  (2-1) 


where 


re 


•  ■. 
K;.; 


•'.^Vv;v,v,v,v;v.v,vvv 
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P  11  P 12 
P  21  ^22 

The  problem  in  H2  and  H v  synthesis  is  to  find  s  controller  K  achieving  ’’internal  stability*  such 
that  the  performance  measure  1 1  F(  (P  ,K  )  |  |a  is  minimized  for  a  —  2  or  oo. 

In  this  chapter,  the  notion  of  internal  stability  for  the  LFT  Tt(P  ,K)  is  reviewed  first  in  Section  2.1. 
Section  2.2  gives  an  algebraic  treatment  of  the  Youla  parametrization  of  all  stabilizing  controllers  in  terms 
of  a  stable  parameter  matrix  Q  6  RH^  with  the  LFT  representation, 


€  R 


(fx  *  Pjxf.Mt*  ■*) 


p„  e  R 


p,  X. 


(2-3) 


a  «*  ) . 

This  parametrization  has  the  additional  property  that  substitution  of  Eq.  2-3  into  Eq.  2-1  yields 


K.V 

<wV 


F ,(P.K)  -  TtiP.TtiJ.Q)) 

-  r,(T.Q) 

where 

F,(r,<?)  -  r„  +  TnQTa  (2-4) 

providing  an  affine  parametrization  of  all  internally  stable  closed-loop  transfer  function  matrices 

F,(P,AT).  The  Youla  parameterization  is  then  constructed  using  the  standard  state-space  computations 

of  observer-based  stabilization  methods,  providing  explicit  realizations  of  the  desired  J  in  Eq.  2-3  in  terms 

of  a  realization  of  P .  The  affine  parameterization  of  the  closed-loop  system  is  also  derived  using  a  state- 

space  realization. 

In  Section  2.3,  a  particular  parametrization  is  derived  such  that  T 12  is  inner  and  T2i  is  co-mner. 
The  algebraic  Riccati  equation  plays  an  essential  role  in  obtaining  such  a  parametrization.  Using  the 
unitary-invariant  property  of  the  Hr  and  //^-norm,  the  tfj'OPtiroal  controller  is  immediately  obtained. 
The  H oo  optimal  control  formulation  is  transformed  into  an  equivalent  "general  distance  problem"  which 
will  be  discussed  in  great  detail  in  the  next  three  chapters.  Section  2.4  presents  two  examples  to  illustrate 
the  H 2  and  H optimal  control  respectively. 


2.1  Internal  Stability 


The  results  in  this  section  are  entirely  standard,  although  typically  not  expressed  in  the  LFT  frame¬ 
work.  For  an  interesting  alternative  treatment  which  is  closely  related  see  Nett  (1986).  In  this  section  P 
and  K  are  fixed  proper  transfer  function  matrices.  The  block  diagram  associated  with  Figure  2-1 
represents  the  two  equations 


u  —  Ky  . 


It  is  convenient  to  introduce  two  fictitious  external  signals,  and  wit  as  in  Figure  2-2. 


y 

w 


e 


w 

2 


Figure  2-2. 


Suppose  the  signals  v.u;,,  and  are  specified  and  that  u  in  Figure  2-2  is  well-defined.  Then  so  are 
e  and  y .  Thus  it  makes  sense  to  define  the  system  as  diagrammed  in  Figure  2-2  to  be  "well-posed"  pro¬ 
vided  the  transfer  function  matrix  from  [ur  w[  w \  ]  to  u  exists  and  is  a  proper  one.  The  following 


theorem  shows  the  necessary  and  sufficient  conditions  of  well-posedness. 


Theorem  1 


The  following  statements  are  equivalent: 


(i)  The  system  as  diagrammed  in  Figure  2*1  is  well-posed. 


(i.)  / 

-  K  (oo)/7 23(00) 

is  invertible. 

(2-5) 

(Hi) 

/  -UT(oo)' 

-Pa(oo)  / 

is  invertible. 

(2-6) 

(iv)  . 

1  -  P^<x>)K  (00) 

is  invertible  . 

(2-7) 

x  “  Ax  +  B2u 


( 2-  12a) 


y  *»  C2*  +  D  2 ju 
*  ■  +  B  y 


(2-12b) 
(2-1 2c ) 


«  —  C*  +  By.  (2-1 2d ) 

The  system  diagrammed  in  Figure  2-1  is  "internally  stable’  provided  the  null  solution  (j,x)  **  (0,0)  of 

Gqs.  2-12a  to  2-12d  is  asymptotically  stable  To  get  a  concrete  characterization  of  internal  stability,  solve 


Gqs.  2- 12b  and  2-12d  for  u  and  y: 


•  vi  r  - 

/  -DOC  x 
D  22  I  C  2  o  X 


(Note  that  the  inverse  exists  because  of  the  well-posed  ness  condition  Eq.  2-11).  Now  substitute  this 
expression  for  u  and  y  into  Eqs.  2-12a  and  2-12c  to  get 


J_  2  .  i  2 

it  2  Z 


0  B2  0  /  _Z)  l'1  0  C 

4  J  +  [o  B  J  -Da  /  C2  0 


Thus  internal  stability  is  equivalent  to  the  condition  that  A  is  a  stability  matrix,  i.e.,  all  eigenvalues  of  A 
lie  in  the  open  Ihp  . 

It  is  not  difficult  to  verify  that  the  above  definition  of  internal  stability  depends  only  on  the  transfer 
function  matrices  P  and  K ,  and  not  on  the  specific  minimal  realizations  of  them.  The  following  result  is 
standard. 


Theorem  S 


Consider  a  minimal  realization  of  the  system  P  as  in  Eq.  2-8.  Then  there  exists  a  proper  real- 


rational  transfer  function  K  achieving  internal  stability  if  and  only  if  the  pair  (^4  ,B2)  is  stabilizable  and 
the  pair  (C2,A  )  is  detectable.  The  latter  stabilizability  and  detectability  conditions  are  assumed 
throughout  this  chapter. 


Eqs.  2-12a  through  2-12d  constitute  a  state-space  representation  of  the  system  diagrammed  in  Figure  2-3. 
Although  the  realization  in  Eq.  2-13  is  not  necessarily  minimal,  it  is  stabilizable  and  detectable,  and  these 
are  enough  to  yield  the  following  result. 


Figure  2-3. 


Theorem  3 

The  system  diagrammed  in  Figure  2-1  is  internally  stable  if  and  only  if  the  system  diagrammed  in 
Figure  2-3  is  internally  stable. 

The  above  notion  of  internal  stability  was  defined  in  terms  of  state-space  realizations  of  P 22  and  K  . 
It  is  also  important  and  useful  to  characterize  internal  stability  from  an  input/output  point  of  view.  For 
this,  consider  the  feedback  system  diagrammed  in  Figure  2-4.  This  system  has  an  input/output  relation- 


It  is  intuitively  clear  that  it  the  system  diagrammed  in  Figure  2-4  is  internally  stable,  then  for  all 
bounded  inputs  (t/j,v2),  the  outputs  (elte2)  are  also  bounded.  This  idea  leads  to  an  input/output  charac¬ 


terization  of  internal  stability. 


Theorem  4 

The  system  diagrammed  in  Figure  2-4  is  internally  stable  if  and  only  if  (1-Pj.K)  is  invertible  and 
the  transfer  function  matrix 


I  -K 

-1 

I+K(l-PaK)'lPa  K  (I  -P~,K  )"' 

rp  a  / 

(1-P-aK  r‘Pj2  [l-PnK]1 

between  (vi,t>2)  and  (e  t,e2)  beloop  to  ft//*. 
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3.3  Parametrlutlon  of  All  Stabilising  Controllers 


For  the  discussion  here,  there  are  two  main  approaches  to  constructing  stabilizing  controllers  for 
linear  systems:  the  Youla  parametrization  and  state-space  methods  using  observers  and  state  feedback 
(K2,L4),  Each  is  well-known  among  the  control  community  and  each  has  its  advantages.  The  Youla 
parametrization  yields  all  stabilizing  controllers  as  well  as  a  convenient  affine  parametrization  of  the 
closed-loop  system.  Unfortunately,  the  standard  algebraic  treatment  of  this  subject  gives  no  reliable 
scheme  to  compute  the  coefficients  of  the  parametrization.  Observer-based  stabilizing  controllers,  on  the 
other  hand,  are  easily  constructed  in  terms  of  a  realization  of  the  transfer  function  matrix  P  using  a 
variety  of  state-space  computation  schemes. 

In  has  been  shown  that  these  two  methods  of  stabilization  are  actually  equivalent.  This  allows  the 
Youla  parametrization  to  be  constructed  using  the  standard  state-space  computations  of  observer-based 
stabilization  methods,  providing  explicit  realizations  of  the  desired  J  in  Eq.  2-3  in  terms  of  a  realization 
of  P .  Combining  the  results  of  the  previous  section  and  this  section,  the  desired  affine  parametrization  of 
the  closed-loop  system  is  then  obtained. 

It  should  be  noted  that  many  of  the  results  on  the  connections  between  the  algebraic  and  observer- 
based  stabilization  methods  were  discovered  independently  by  Nett  and  coauthors  [N2|.  Also,  many  of 
these  results  were  known  within  the  "systems  over  rings"  community  [K4j.  A  complete  treatment  on  the 
equivalence  of  these  two  stabilization  methods  and  the  parametrization  in  terms  of  the  general  framework 
(Figure  2-1)  using  linear  fractional  transformation  was  first  given  by  Doyle  [Dl6j. 

The  following  definitions  of  coprimeness  in  RH&  provide  the  appropriate  framework  for  the 
equivalences  to  be  discussed. 


Definition!  (Right  CoprlmeneM) 

Two  matrices  N ,  M  6  w»th  the  same  number  of  columns  m  are  right  coprime  if  there  exists 


X ,  Y  €  RH *  such  that 


XM  +  YN  =  L 


A 


Definition)  (Left  Coprlmeneu) 

Two  matrices  N ,  M  6  RR  x  with  the  same  Dumber  of  rows  p  are  left  coprime  if  there  exists 

X ,  Y  6  RR ao  *uch  that 

MX  +  NY  -  I,  .  (2-17) 


Remarks 


(1)  Eq.  2-16  (2-17)  is  equivalent  to  saying  that  the  combined  matrix 
inverse  in  /?//<-,. 


f M 1 

N 


([.W  n]) 


has  a  left  (right) 


(2)  An  alternative  definition  for  the  right  (left)  coprime  factorization,  ref  ( lef  ),  is  that  two  matrices  in 
RHm  are  right  (left)  coprime  if  every  common  right  (left)  divisor  in  RRX  is  invertible  in  /?//„. 
This  is  equivalent  to  the  above  definition  in  terms  of  a  left  inverse. 

(3)  Eq.  2-16  (or  Eq.  2-17)  is  often  called  a  Bezout  (or  Diophantine)  identity. 

It  is  a  fact  that  every  G  €  Rr  (proper,  real-rational  transfer  function  matrices)  has  an 
ref  G  NM'X  where  the  pair  N,M  €/?//«>  are  right  coprime.  Similarly,  there  exists  an  lef  , 
defined  in  the  obvious  way  by  duality.  The  proof  of  the  existence  of  such  coprime  factorizations  can  be 
found  in  several  publications  [D16,K4,N2,V3]  with  explicit  realizations  for  the  factorizations.  In  this  sec¬ 
tion,  it  will  be  shown  how  these  factorizations  can  be  used  to  obtain  a  parametrization  of  all  stabilizing 
controllers. 


Beginning  with  ref  s  and  lef  s  of  P &  and  K  in  Figure  2-4: 

PB  -  NM~l  -  MlN  , 

K  -  UV  l  -  V-'O  . 


(2-18) 

(2-19) 


The  following  lemma  is  well-known. 


Lemma  1 
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Consider  the  feedback  system  diagrammed  as  in  Figure  2-4.  The  following  conditions  are 
equivalent: 


(i)  The  feedback  system  as  represented  by  Eqs.  2-12a  to  2-12d  is  internally  stable. 


(ii) 

(iii) 

(iv) 

(v) 


M  U 

N  V 

V 

-  N 


is  invertible  in  RH 


-  0 
M 


is  invertible  in  RH^. 


V  M  -  U  N  is  invertible  in  /?//*,. 
M  V  -  N  V  is  invertible  in  RH 


Explicit  realizations  for  N ,  M ,  N ,  M ,  V, ,  V, ,  U ,,  and  V ,  will  be  given  later  in  this  section 
which  satisfy  Eq.  2-18  and 


V. 

-0,' 

M 

u.‘ 

l  0 

-  N 

M 

N 

• 

v. 

0  I 

which  is  often  also  called  "doubly  Bezout  identity”.  By  the  above  lemma 

K,  *  U,  vv*  -  v;'0. 


(2-20) 


(2-21) 


then  qualifies  as  a  particular  controller  achieving  internal  stability.  The  result  of  the  next  theorem  means 
that  all  stabilizing  controllers  can  be  expressed  in  terms  of  matrix  K,  and  a  parameter  matrix  Q  6  RH 
The  proof  can  be  found  elsewhere,  for  example,  |D5,D16,V2,V3|. 


Theorem  2 

The  set  of  all  proper  controllers  achieving  internal  stability  for  the  feedback  system  (see  Figure  2-1) 
is  parametrized  by  the  formula 


K  -  (U.  +  MQ  )( V,  +  NQ  )"’ 
-  (V.  +  QN  )■'({/ ,  +  QM ) 


(2- 22a) 
(2- 22b) 


V  % 
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-  K,  +  V  ;lQ(I  +  V,-}NQ  (-‘l/,-1 
where  Q  ranges  over  RH *  such  that  (/  +  V,~lNQ  )(oo)  is  invertible. 


(2-22e) 


It  is  not  difficult  to  recognize  that  K ,  as  in  Eq.  2-22 c,  can  be  expressed  in  terms  of  a  LFT  as  shown 
in  Figure  2-5,  i.e. , 


where 


*  -  7,(J,Q) 


k.  vV 

V-1  -  V,-'N 


(2-23) 


Therefore,  it  is  concluded  that  every  stabilizing  controller  can  be  represented  as  a  transfer  function 
matrix  K  «■  T,(J ,Q  ),  as  in  Figure  2-5,  for  some  parameter  matrix  Q  €  RH<x>  (constrained  only  to  be 
stable  and  proper  and  to  make  K  proper).  Using  results  of  the  above  lemma,  the  affine  parameterization 
of  the  closed-loop  transfer  matrix  follows  immediately. 
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Theorem  3 

The  set  of  all  closed-loop  transfer  function  matrices  from  v  to  e  (Figure  2-2)  achievable  by 
nally  stabilizing  proper  controller  as  in  Theorem  2  is 

|  Tu  +  Tl2QTn  :  Q  /  +  DziQ(oo)  invertiblej. 

where 

Tu  P  u  +  P l2Ut  M P 2i 
T  i2  —  P  \2M 

and 

r2i  =”  M  P  a 

( P,t ,  M ,  and  V ,  are  defined  as  in  Eqs.  2-2,  2-18,  and  2-21  respectively). 

Proof 

Substituting  Eq.  2-22a  into  F|  (P  ,K )  in  Eq.  2-1  yields 

F ,{P,K)  -  Pn  +  PM.  +A/Q)(K,+N<?)-1[/  +MQ)(V,  +  *<?)->]  ^ 

-  Pn  +  PM  +MQ)[ti(V.  +NQ)-N[U,  +MQ)YmP21 

Eqs.  2-18  and  2-20  then  yield  M  N  ■»  N  M  and  M  V,  -  N  U,  =  /  .  Therefore, 

Fi(P,K)  -  Pu  +  PM.  +MQ)MPn 

-  (Pn  +  PM.MPn)  +  ( Pl2M)Q{MP2i ) 

=*  T  ii  +  T  i2QT2l 

where  T u,  T  jj,  and  T 21  are  defined  as  in  Eq.  2-24. 


an  inter- 


(2-24a) 

(2-24b) 

(2-24c) 


n 


(2-25) 

QED 


Remark 


Eq.  2-25  is  also  a  linear  fractional  transformation,  Ft(T ,Q  )  with  Ta  =  0. 

In  the  following,  the  coprime  factorizations  will  be  given  in  terms  of  state-space  representations. 
Using  these  formula,  an  explicit  realization  of  one  choice  of  the  interconnection  matrix  J  (Eq.  2-23)  is 
then  derived. 

[a  fll 

For  the  next  two  lemmas,  it  is  assumed  that  G  =*>  ^  p  6  Rf*m  where  the  pair  (A  ,B  )  is  sta- 

>  a 

bilizable  and  the  pair  ( C  ,A  )  is  detectable.  The  following  lemma  shows  that  an  ref  of  G  can  be 
obtained  using  a  stabilizing  state  feedback  gain  matrix  F .  The  existence  of  F  is  guaranteed  since  (A  ,B ) 
is  stabilizable. 


Lemma  4 


A  stabilizing  state  feedback  F  yields  ref  G  —  NM~l  where 


(2-26) 


Note  that  for  any  nonsingular  Z 

A +BF  BZ 

-  F  Z  .  (2-27) 

C+DF  DZ 

is  also  a  realization  of  an  ref  of  G . 

By  duality,  to  get  an  lef  of  G ,  take  H  such  that  A  +HC  is  a  stability  matrix.  Call  such  a  stabil¬ 
izing  matrix  H  an  output  injection  gain. 


Lemma  4' 


A  stabilizing  output  injection  gain  H  yields  lef  G  ™  M~lN  where 
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A+HC 

H  B+HD 

C 

l  D 

(2-28) 


The  next  step  is  to  specify  U, ,  V, ,  U , ,  V ,  to  satisfy  the  doubly  Bezout  identity  in  Eq.  2-20.  The 
idea  behind  the  choice  of  these  matrices  is  as  follows.  Using  observer  theory  [Kai],  find  a  controller  K , 
and  the  associated  matrices  F ,  H  achieving  internal  stability  and  then  perform  factorizations  on  P and 
K,  analogous  to  the  ones  just  performed  on  G  (Eqs.  2-26  and  2-28).  Then  the  result  summarized  as  in 
Lemma  1  implies  that  the  left-hand  side  of  Eq.  2-20  must  be  invertible  in  RH^.  In  addition,  Eq.  2-20  is 
satisfied. 


The  transfer  matrices  N ,  M ,  N ,  and  M  in  Eq.  2-18  have  the  following  realizations, 


M 

N 


A  +5 

Bn 

=» 

F 

I 

C  2  +  D  22  F 

D 

A  +HC  2 

H  B2+HDn 

C, 

l  Dn 

The  realization  equations  for  K „  are 

x  =*  Az  +  Bjii  +  H(C2z  +  D jju  -  ]/) 
u  Fz  , 


that  is, 


A  +B2F+HC2+HD  &F 

-H 

F 

“0  , 

Define 

A  *  A+B2F  +HC 2+HD -bF  , 
C  -  F  , 

F  l=x  C2  +  D  &F  , 


B  -  -H  , 

D  -  0  , 

H  =  -(Bj+HDk)  . 


(2- 29a) 


(2- 29b) 


(2-30) 


Following  Eqs.  2-26  and  2*28,  define 


[v.  0.} 


A  +BF  \B 
F  1 
6  +  DF  D 


A  +H  C\H  B  +  HD 

C  |7  D 


A  +BF 

-H 

C+DF 

I 

.  F 

0  . 

(2-31a) 


A+HC2\-(B2+HDa)  -H 
F  /  0 


(2-3lb) 


Therefore, 


M  U,' 
N  V, 


A  +BjF 

B  2  -H 

F 

I  0 

0  2  +  D  22^ 

Dn  1 

V,  -u, 
-  N  M 


.4  +HC2 

~{B2+HDa)  H 

F 

C z 

/  0 

-Dn  I 

(2-32a) 


(2-32b) 


and  Eq.  2-20  is  satisfied. 

A  realization  of  J  is  now  immediate.  Substitution  of  Eqs.  2-29a,  2-30,  and  2-31  into  Eq.  2-23  leads 
to,  after  simplification, 


A  +B2F +HC  2+HD  &F 

-H  B2+HDn 

F 

-(CJ  +  DaF) 

0  I 

I  ~Da 

Theorem  2  provides  a  parametrization,  in  terms  of  Q ,  of  all  proper  K  s  which  achieve  internal  sta¬ 
bility  in  Figure  2-1.  Substitution  of  the  transfer  function  relationship  of  the  block  diagram  in  Figure  2-5 
into  that  in  Figure  2-1  leads  to  the  one  in  Figure  2-6.  Elimination  of  the  signals  u  and  y  leads  to  the 
transfer  function  relationship  diagrammed  as  Figure  2-7  for  a  suitable  transfer  matrix  T .  Thus  all 
closed-loop  transfer  matrices  are  representable  as  in  Figure  2-7. 


'v  ■  J\.‘‘ /■  V-  ■-*1“-".'-  '-  v  v  v'  ■  ■  ■ 


The  following  theorem  given  one  particular  realization  of  T . 


Theorem  6 


Consider  T  as  in  Figure  2-7,  then  T  has  a  representation  in  the  form 


‘  a+b2f  -hc2 

-PD  M 

T  u  T 12 

0  A  +HC2 

B  i+HD  2i 

** 

T  2i  0 

* 

C  x  +  D 12  F  C  i 

0  °2 

D  21 

where  the  matrices  F  and  H  are  defined  as  in  Eqs.  2- 29a  and  2-29b. 


Proof 


(')  Tu: 


Substituting  Eqs.  2-8  and  2-29a  into  Eq.  2-24b, 


12 


A 

b2' 

|"i4  +  B2F 

If*] 

Cx 

D*. 

* 

L 

F 

rrJ 

A 

b2f 

B 2 

- 

0 

a+b2f 

B, 

D  i2 

F  1 

D  12 

Applying  the  similarity  transformation 


/  -/ 
0  1 


yields, 


A  0 

0  A  +B2F 

0 

b2 

'  A+B2F  ] 

B 

0 1  ^  D 12^  1 

~D 

0 1  C  x  +  D 

D  12 

(»)  r„: 


Substituting  Eqs.  2-8  and  2-29b  into  Eq.  2-24c, 


Applying  the  similarity  transformation 


i0“  [o  /] 


yields, 


A+HC 2  0 

B  i  +  HD  21 

A  +HCa 

T  21  *• 

0  A 

Bx 

— 

C2 

C2  0 

D  .2 

>i)  Tn: 

From  Eqs.  2-8,  2-31a,  and  2-35b, 


A 

h' 

A+B2F\-H 

A  +//C2 

c7 

D  \2 

* 

F  |  0 

* 

c2 

A 

B2F 

0 

0 

0 

A+B2F 

-hc2 

-HD21 

0 

0  A 

+  hc2 

B 

1  +  HD  21 

cT 

Dl2F 

0 

0 

1 

7  -/  -/ 

1 

Applying  the  similarity  transformation 

0  /  0 

yields, 

1 

0  0  / 

J 

‘ A 

0 

0 

-B , 

0 

A  +BS 

-hc2 

P  12^9  A/  P  2 1  ■ 

0 

0 

A  +HC2 

J3,  + 

21 

Pt 

C  i  +  D 12F 

c  1 

1 

0 

Therefore,  from  Eq.  2-24a, 

Tn 


P 11  "#■  P  12^1  M  P  21 

A 
0 
0 


A 

Si' 

,£T 

0  0 
A  +B2F  -HC2 
0  A  +HC, 


Bt  +  HD 

d7i 


i  ^  ^  ci 

D  21 


-HD  a 
,  +  HD, 


+ 


3 


A  +B3F  -HC2 

0  A 

+  hc2 

fl,  +  HD2i 

C  i  +  D  ijF 

Cl 

Dn 

(2-35c) 


Combining  Exp.  2-35a  through  2-35c  and  the  fact  that  Ta  “  0,  Eq.  2*34  follows  immediately. 


Remark 

The  following  expression  is  an  alternative  realization  for  T  which  was  given  by  Doyle  [D16j. 


A  +B2F 

-BaF 

Bi 

b2 

0 

A  +HC2 

Bt+HD2l 

0 

Cj  ■f  D 12 F 

“  D  \2F 

Dn 

D 12 

0 

C2 

D  a 

0 

It  is  important  to  note  that  the  closed-loop  transfer  matrix  is  simply  an  "affine”  function  of  the  con¬ 
troller  parameter  matrix  Q  and  that  the  coefficient  matrices  T,,  have  very  simple  realizations;  namely  as 
described  by  Exj.  2-34  (or  Ex).  2-36). 

Note  that  the  Youla  parametrization  and  associated  observer-based  controller  described  above  allow 
choice  of  the  matrices  F  and  H .  In  the  next  section,  specific  choices  of  F  and  H  will  be  presented  such 
that  the  affine  parametrization  in  Theorem  3  has  additional  properties,  namely  that,  T  l2  and  T 2i  are 
inner  and  co-inner  respectively.  The  algebraic  Riccati  equation  will  play  an  essential  role  in  obtaining 
such  a  parametrization. 


r  I 


S.3  Coprime  Factorisation  with  Ianer  Numerator 


Section  2.2  contains  a  summary  of  methods  for  finding  the  interconnection  matrix  J  so  that  the  sub¬ 


stitution  A-  »  F(  (J,Q)  yields 


F i(P,K)  -  F/(P,F({/ ,Q  )) 


F  ,(T.Q) 


Fn  +  F12<3ra 


with  the  additional  requirement  that  7  6  //»  and 


F|  (P  ,K )  internally  stable 


if  and  only  if  Q  6  H x  . 


This  parametrizes  all  stabilizing  controllers  K  in  terms  of  a  stability  matrix  Q  €  //«  in  addition  to 


providing  an  affine  parametrization  of  all  stable  F ( (P  ,K ).  The  actual  structure  of  J  was  derived  in 


terms  of  an  observer-based  compensator.  The  stabilizing  state  feedback  and  output  injection  of  the 


observer-based  compensator  were  shown  to  provide  coprime  factorizations  of  P&  and  solve  the  Bezout 


identities  necessary  to  provide  the  parametrization  of  all  stabilizing  controllers. 


In  this  section,  the  requirement  is  added  that  7I2  and  T21  be  inner  and  co-inner  respectively;  that 


is,  r,*rI2  -  /  and  T21T2t  ■■  /  .  In  addition,  and  f  ^  will  be  found  so  that  £ti2  TjJ 


Fji 

~  are  square  and  inner. 

1  1  J 


The  key  idea  behind  the  factorization  in  this  section  is  the  connection  between  inner  functions,  alge¬ 


braic  Riccati  equations  (AREs),  and  spectral  factorizations.  The  stabilizing  solution  of  the  ARE's  will  be 


needed  in  order  to  construct  the  desired  factorizations. 


Coprime  factorizations  with  inner  numerator  and  complementary  inner  factors  (C1F)  will  be 


obtained  which  involve  using  a  state  feedback  or  output  injection  gain  based  on  the  stabilizing  solution  of 


a  particular  ARE.  This  provides  a  reliable  computational  method  based  on  standard  approaches  to 


finding  solutions  of  AREs  [C5,K8,L1,M1,M2,M3,P3,V1|.  A  complete  treatment  on  this  subject  can  be 


found  elsewhere  [A3,A4,C2,D16,Wl,Yl|. 


s’  s'  s’ Vs 


p. mi ».’J «,i » jm* jmni.iVAW W'MU.  WA1  v. ~ s' " j. 


\ 

►V 


?;.'  v  ■.  “  ' "’S'  v  v/.v.  ’jAv  >  1 
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Consider  the  Algebraic  Riccati  Equation, 


EtX  +  XE  -  ATH^  +  Q  -  0 
with  the  associated  Hamiltonian  matrix 


(ARE) 


A#  =* 


£  -IF  ‘ 

-<3  -ErJ  ’ 


(Hamiltonian) 


where 

E,  w,  Q  en9Xn  ,  W  =«  Wr  >  0  ,  and  Q  =  Qr  . 

The  following  theorem  and  corollary  characterize  the  relationship  between  spectral  factorization,  AREs, 
and  decomposition  of  Hamiltonians.  The  proofs  can  be  found  in  [D16|. 


Theorem  1 

Let  A ,  B ,  P ,  S ,  and  R  be  matrices  of  compatible  dimensions  for  the  ARE  such  that 
P  PT  ,  R  RT  >  0,  with  the  pair  (A  ,B )  stabilizable  and  the  pair  {P  ,A  )  detectable.  Then  the 
following  statements  are  equivalent: 


(i)  The  para-Hermitian  rational  matrix 


n»)  =  [Br(-s/-Ar)-*  /]  J/r  1 


(bI-A  Y'B 

l 


satisfies 


T(;  w)  >  0  for  all  0  <  u  <  oc  . 

(ii)  For  E  =*A-B/?'lSr  ,  IV  =  BR~'BT ,  and  Q  =  P  -SR ~lS  T ,  there  exists  a  unique  real  matrix 
X  *  XT  such  that 

E T  X  +  XE  -  XWX  +<3*0 
and  (E-BR'xBtX)  is  a  stability  matrix. 


58 


(iii)  The  Hamiltonian  matrix 

Ah 


a-br~1st  -br-'bt 

-P  +SR~lST  -(A-BR-lST)T 


baa  no  eigenvalues  on  the  j  w-axis. 


Remark 

m 

The  unique  stabilizing  solution  of  the  ARE  (X  aa  in  (ii)  of  Theorem  1)  will  be  denoted  by  Rie  (AH ) 
throughout  where  Ah  is  the  associated  Hamiltonian  matrix. 


Corollary  3 

If  the  conditions  in  Theorem  1  are  satisfied  then  5  A/  6  Rp  such  that  Af'1  6  RH a 0  and 

T  -  M*RM  . 


A  particular  realization  of  one  such  M  is 


M 


A 

B 

-F 

/. 

(2-37) 


where  F  -  -  R'l(ST  +  BT X) . 


In  the  following,  the  special  form  of  coprime  factorizations  required  to  simplify  the  general  H a 
(a  as  2,  oo)  optimal  control  to  a  distance  problem  will  be  developed.  In  particular,  explicit  realizations 
are  given  for  coprime  factorizations  G  *  NM~l  with  inner  numerator  N  (Theorem  4)  and  for  the  com¬ 
plementary  inner  factor  which  completes  the  inner  numerator  to  make  A/jJ  square  and  inner 

(Theorem  5).  The  results  will  be  stated  as  theorems  for  re/  s.  The  duals  for  lef  s  following  just  as  for 
the  general  case  of  coprime  factorization  developed  Section  2.2. 

The  following  lemma  summarizes  the  necessary  and  sufficient  conditions  of  an  inner  transfer  matrix 


with  state-space  representation.  The  proof  can  be  found  in  |D16]. 


Lemma  3 


The  transfer  matrix  N  *  ^ 
(i)  BtX  +  D  TC  -  0 


A  B 

€  /?//«  i*  inner  if  and  only  if 


(ii)  DT D  l 

where  the  observability  gramian  X  solves 

A  T  X  +  XA  +  C  rC  -  0 


For  the  next  two  results,  it  is  assumed  that  G  — 


€  R/xm  and  the  realization  for  G  is 


minimal.  The  notation  Rp*  (Rp  >  0)  will  be  used  to  denote  the  square  root  matrix  such  that 
(Rpn)T Rpn  “  Rd  (or  Rd^Rd^V  “  Rp)  and  use  for  any  orthogonal  complement  of  D  so  that 

I/s  D  |  j  (with  Rp  **  D  T  D  )  is  square  and  orthogonal. 

From  Lemma  2.2.4,  a  stabilizing  state  feedback  F  yields  ref  G  =*  NM  l  where 


A +BF  BZ 
F  Z  . 
C+DF  DZ 


and  Z  can  be  any  nonsingular  matrix.  To  obtain  an  ref  with  N  inner,  we  simply  need  to  use  Eqs.  2-38 
through  2-41  to  solve  for  F  and  Z .  This  yields  the  following  result. 

Theorem  4 

Assume  p  >  m  .  Then,  there  exists  an  ref  G  »  with  X  inner  if  and  only  if  G  *G  >  0  on 

the  ;u- axis,  including  at  oo.  This  factorization  is  unique  up  to  a  constant  orthogonal  multiple.  A  par¬ 
ticular  realization  for  the  factorization  is 
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M 

N 

A  +RF 

BRD-'n' 

- 

F 

rd 

C+DF 

DRD-lfl 

(*-42) 


where 


Ro  -  DtD  >0 

F  -  -Rd-\BtX  +DtC)  (2-43) 


and 


Rie 


A  -BRd-1D  t  C  -BRd-1B  t 
-CrD1DlrC  ~(A-BRd1Dt  C)T 


>  0 


(2-44) 


In  a  similar  manner  Eqs.  2-38  through  2-40  can  be  used  to  obtain  the  complementary  inner  factor 
(CIF)  in  the  following  theorem. 


Theorem  6 

If  p  >  m  in  Theorem  4.  then  there  exists  a  complementary  inner  factor  (CIF)  N ^  €  RH 
such  that  the  matrix  f iV  is  square  and  inner.  A  particular  realization  is 


A  +BF 

-x'ctd1 

C+DF 

D1 

(2-45) 


where  X  and  F  are  from  Theorem  4  and  is  the  pseudo-inverse  of  X  . 


Remarks 

(1)  If  G  €  RHl oX"  in  Theorem  4,  then  M  is  a  "unit"  in  /?//«,  and  Af*1  is  "outer".  In  this  case, 
G  *  N  (A/-1)  is  called  an  "inner-outer  factorization"  (IOF). 

(2)  Dual  results  for  all  factorizations  can  be  obtained  when  p  <  m  .  In  these  factorizations,  output 
injection  using  the  dual  Riccati  solution  replaces  state  feedback  to  obtain  corresponding  left  fac¬ 


torizations. 


so 


fjf* 


In  the  rest  of  this  section,  the  results  of  the  Youla  parametrization  which  parameterizes  ail  stabiliz¬ 
ing  controllers  and  coprime  factorization  with  inner  numerator  will  be  combined  in  a  way  that  is  con¬ 
venient  for  solving  //optimal  and  H ^-optimal  control  problems. 

Assume  P  as  the  transfer  function  matrix  associated  with  Eqs.  2-2  and  2-33  and  neither  P12  nor  P  a 
has  transmission  zeros  (see  reference  [Kl]  for  definition  of  transmission  zeros)  on  the  j  i^-axis  (including 
oo).  This  implies  that 

Rp— D^D  12  >0  and  /?/>■■  02jPar  >  0  • 

Under  these  assumptions,  let  D ^  —  (Z)12)j  and  “  (Pa)|  .  that  is,  ^Z)l2Pp~1'*  and 

a  j  are  orthogonal  matrices  where 


(PDW) r  P0W  -  Rd  and  R  %\R  ?) r  -  R  „  . 

By  Theorem  4,  S  l3  =  T  12P D'lft  is  inner  where  the  state  feedback  gain  matrix  F  is 
F  =  -  Rpl[D  i2C i  +  BfX) 


A  —  B2/?£j  12 C  j  —  J32/?p*B2* 

- ClD1DTiCx  -(A  -B2RD-'Df2Cx)T 


(2-46aj 


(2-46b) 


Similarly,  using  the  dual  form  of  Theorem  4,  iV2l  £  R  p^T 21  is  co-inner  where  the  output-injection 


gain  H  is 


H  -  -(BiDl  +  YCl)R  d 


(2-47a) 


Y  -  Ric 


(A  -B,DlRBxC 2)r 

-BxD?DxBj 


-  C2rP  5lC2 

-(A  -BxdIxRb'C2)  ■ 


(2-47b) 


Then,  N ,2/V  12  /  and  /V21iV2*  »■  /.  Also,  let  N ^  and  jV^  be  the  corresponding  CEFs  so  that 


» 


A  +  B  2 F 

b2rd -w 

-x'c{dl 

Cx  +  DnF 

durd-1* 

*>l 

Soy' 

’  /t  +  WC2 

B  j  +  //Z?  21 

i  2 1 

/V, 

R  dwC2 

1  J 

-dlb[y' 

5i  . 

Therefore, 


(2-48) 

(2-49) 


f  ap.k)  =  f,(7\q) 

=  7"  u  +  Tl2QT2i 

=  T n  ~  N  u(  -  Rq^QR  o^)iV21 

=  Tu  -  N nQ  jV21  (2-50) 

where 

Q  =  -  Rd'QR  o  j  €*/f.  (2-51) 

is  the  new  stable  parameter  matrix. 

Because  both  the  ||  •  ||2  and  |J  •  1 1<„  norms  are  unitarily  invariant,  an  alternative  expression  is  pos¬ 
sible.  For  any  Q  €  /?//<*,  , 


"  N X2Q  N 2X 


R\\-  Q  R  is 
R  a  /?  22 


r  i  *  . 

• 

^jV12  A^|  J  (  Tjj  -  N l2Q  S a 

A. 

N&T uiVjJ  -  g  /V^rnfVj*' 

.  iVA*r„Afa  Afi*r„iv1*< 

where 


R 11  ^  12 
/?  21  /?  22 


V  * 

<v  12 

.v 


1  J 


['V:*  A';] 


(2-52) 


R 


(2-53) 


Now,  the  a  »  2  case  is  particularly  simple.  Since 


R  u  -  Q  R  i2 

R  21  R  22 


the  optimal  Q  is  found  immediately  to  be 


Rn- Q  2  + 


#  21  ^  22  J  2 


Q  =*  P h 3{R n)  (2-54) 

and  the  the  corresponding  optimal  Q  is 

Qcpt  “  -  Rp  '^PujiR  n)R  DXn  (3*55) 

The  case  of  a  =*  oo  is  more  complicated  and  the  formulation  in  Eq.  2*52  is  called  the  'general  dis¬ 
tance  problem”  (in  short,  GDP)  which  will  be  investigated  in  great  detail  in  the  next  three  chapters. 

A  state-space  realization  for  R  is  given  in  the  following  theorem.  It  is  suggested  that  all  the  poles 
of  R  are  in  the  open  rhp  ,  i.e.,  R  is  completely  unstable.  For  convenience,  R  is  represented  as  the  cas¬ 
cade  operation  from  two  system  matrices. 

Theorem  6 

Consider  R  from  the  formulation  in  Eq.  2-53,  then  R  has  the  realization 

-(A  +  B2F)T  j(<7t  +Dl7F)T  -XH]  \  -(A  +HC2)T  I  -(Rp^C^  Y'BXD  ,r 


(B^p-^Y  (Dl2Rp-”y  0  *  C,Y  +Dll(Bl  +HD2l)T  Dn[R  p~'»D2l)T  D  nD 

- D\CxX T  D[  0  0  Rf  0 

r 

Note  that  the  realization  of  T  „,  Eq.  2-35c,  can  be  expressed  as  the  cascade  of  two  systems,  i.e. 

A+B2F  -HC2  - HD2X 

T *  0  A  +  HC i  B\  +  HD 2i 

C \  +  D  i ■tF  Ci  D  ii 


A  +B2F 

-hc2 

0 

A  +  HC3 

C  i  "¥  D  12^ 

c , 

A  +  2^  I 

O 

1 

0 1  +  D 

/  0  J  * 

( T n)yi  ( T h)b 


It  is  convenient  to  compute 


(Tu)a 


yVj*j  separately. 


‘N& 

-(A  +B2F)T 

(C 1 

+  DX2F)t' 

‘  A  +B2F  10  -H 

(T  nU  - 

-(B2Rd-'*)t 
D[CxX ' 

(D 

«Uo“^r 

DI 

Cx  +  DnF\  I  0 

-(A+B2F)t  {Cx  +  D12F)t(C1  +  DuF) 

(Ci  +  D  &F ) 

0  A  +B2F 

0 

-(B2Rd-^)t  (D12Rd-1*)t(Ci+D12F) 

(d12rd-1*)t 

D[CxX t  D[(CX  +  DX2F) 

m 

Dl 

-(A+B2F)t 

C,PIDirC,  +XB^dxB2X 

{C\  +  D  aF ) 

0 

A  B  2F 

0 

zm 

-(BA'T 

~(B2Rd-^)tX 

(■ DnRD-'n)T 

D[CtX ' 

D[C, 

DI 

Applying  the  similarity  transformation 


/  0 
-X  I 


(AT  is  as  in  Eq.  2-46a)  yields, 


-( A+B3F)t 
0 


(C,  +DuF)t  -xh 
-X(Cx+D12F)t  -h 


(*) 

A  +B  2? 


Ker(X)  C  Kw(DfCi)  — «>  0(0^1 -XX')  =  0  . 


-{A  +BiF)T 

(C^D^F)7  -XH 

• 

•  • 

(r„)a  “ 

-(B3Rd-'*)t 

DfCiX1 

(d12r0-^)t  0 

Bf  0 

(b) 


a+wc2 

fl,  +  BBa 

c, 

Dn 

c2 

D  22 

'  -(A+HC2)t 

(Rd^Cz)7  -Y'Bjf 

* 

[-(B1  +  HD21)t 

(*DVaD2l)T  D  7 

A  +HC2  -(Bt  +  HDn)(Bx  +  HD2l)7 

(B,  +  HD  2l)D  7\  (B,  +  HD2l)D 

0  ~(A+HD2l)T 

(Rp^C,)7  -  YyBxD  7 

C,  -  D  n(Bt  +  HD2\)7 

DniRo^Dz,)7  DnD  7 

C2  Ba(Bi  +  HD  a)7 

R  &  0 

A+HCz 

~{B\D  *D ^B7) 

-YC7 

B\D  7 

0 

-\A  +HC2)T 

(Rp^Czf 

-  Y^B\D  7 

-  Du(Bt  +  HD2i)7 

D^Rp^Dz,)7 

DnD7 

C2 

-c2y 

Rf 

0 

Applying  the  similarity  transformation 


V  Y 
0  / 


(Y  is  as  in  Eq.  2-47a)  yields, 


(J"n)i  [^2i  N{] 


a+hc2 

(©) 

0 

(I -YY')B iD  7 

-(A  +HC2)t 

(Rp'^CzV 

-  r’B,/?/ 

c , 

-  CtY -Dll(Bl  +  HDzi)7 

Dn(RoV7D2  ,)r 

DnD7 

c2 

0 

R? 

0 

where 


(©)  »  -  {(A+WC2)K  +  Y(A+HC2)T  +  fl,0'IrPiB,r  +  KC2rB5lC2r[ 

-  -  {(*-0,I>ar,J?5IC2)r  +  Y{A-B \DlxR  plC2)r  -  YClRBlC2Y  +  BlD1rDi  B,rJ 


•V  /  V  ** 


04 


0  . 


Also, 


Ker(K)  C  Ker(0  Lfl,r)  — >  (/-IT^BiJJ/— 0 


(Tu)t 


[a21  N*] 


-{A  +HC2)t 

-(RD-l*c2)T  Y'BxD{ 

C  ( Y  +  D  u(fl  i  +  HD  2i ) r 

0 

Rf  0 

•  (2-57) 


Combining  Eqs.  2-56  and  2-57,  the  theorem  follows  immediately. 

QED 

Remark 

This  realization  is  not  unique,  An  equivalent  expression  can  be  found  in  [D16|. 

It  is  clear  that  R  has  all  of  its  poles  in  the  open  rkp  .  Thus,  the  projection  onto  H2  +  <£  leaves 
only  the  constant  term.  Therefore,  in  the  H j  case  one  has  the  following  result. 


Corollary  7 

The  Hj-optimal  solution  is 

Q.„  -  -Ro'D&DnD&R  S'  (2-58) 

In  the  Woo  case,  the  solution  of  the  general  distance  problem  is  not  trivial  which  will  be  discussed 
extensively  in  the  next  three  chapters.  To  illustrate  the  approach  described  in  this  chapter,  two  examples, 
corresponding  to  H2  and  H &  optimization  respectively,  will  be  presented  in  the  next  section. 


2.4  Examples 

In  this  section,  two  examples  will  be  presented.  In  the  first  example,  the  familiar  linear  quadratic 
Gaussian  (LQG)  formulation  is  treated  using  the  general  optimal  control  theory  described  in  previous 
sections. 


Example  1<  (LQG) 


Consider  a  linear  time-invariant  system  with  the  following  stochastic  description 


x  tm  Ax  +  Bu  -f  Gd 


y  =*  Cx  +  Nn 


(2-59a) 

(2-S9b) 


.4,  G  €  R"x",  B  €  R"x",  C  €  R'**  and  N  £  W*'"  , 

(A  ,B  )  is  controllable  and  (C ,A  )  is  observable, 

E[d(r)|  *  0,  E  [<f(0</r(r)]  - 
E  |n(<)J  -  0,  E  Jn (< )n  r(r)J  -  /$(<-r), 
and 

N  is  nonsingular. 

The  standard  LQG  formulation  involves  minimization  of  the  (expectation  of)  quadratic  cost  func¬ 
tion,  that  is,  minimization  of 


E  J{  ||  *  III  ♦  ||  u  ||*  )dt 


with  respect  to  x  and  u.  In  addition,  the  factorizations  Q  =**  LT  L  >0  and  W  ==*  MTM  >0  are 
assumed.  A  block  diagram  representation  is  shown  in  Figure  2-8  which  can  be  further  rearranged  into  the 
general  synthesis  framework  suggested  in  Figure  2-9. 


(si- A)1 


r 

L$G  0  L$B 

d 

t; 

— 

0  0  M 

n 

.V, 

C$G  N  C$B 

u 

The  interconnection  matrix  P  of  Figure  2-9  in  partitioned  form  is  simply 


P 


>ti 

P 12 

P  21 

P  22 

L$G  0  L$B 
0  0  M 

C$G  N  C$B 


and  has  the  following  realization 


A 

GOB 

L 

0  0  0 

0 

0  0  M 

C 

0  N  0 

By  assumptions,  (A  ,B )  is  a  controllable  pair  and  (C  ,A  )  is  an  observable  pair.  Therefore,  according 
to  Section  2.3,  the  state  feedback  gain  matrix  F  and  observer  gain  matrix  H  can  then  be  found  by  solv¬ 
ing  the  appropriate  AREs.  Explicit  solutions  for  F  and  H  are: 


(i)  F  *  -  (A/rA/)-‘ 


{  [o  m]  [Jj  1-  B'x] 


-  (MrM)-'BrX 


A  -B(MTM)'lBT 
where  X  "•  RIc  £ r  £  \ r 


(ii)  H  -  -  {  [<7  oj  v°r  +  YC T  |(ArArr)_l  -  -  YCt(NNtYx 


f  At  -CT  (NNt  yl  c] 
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Hence,  from  Eq.  2-32a,  the  matrix  J  has  the  following  realization: 


J 


\A 


+BF+HC 

F 

-C  i 


-H  B 
0  / 
/  0, 


Since  the  external  inputs  are  assumed  to  be  white  Gaussian,  and  the  performance  is  measured  in 
terms  of  the  minimum-mean-square-error  (MMSE)  criterion,  it  can  be  shown  that  this  is  an  H2  optimal 
control  problem.  Therefore,  from  Corollary  2.3.7,  the  H j-optimal  solution  is  * 

Q.„  “  ~(MT M )'*  [o  Mt]  [j  °]  ] (NNT)~l  -  0. 

The  optimal  stabilizing  controller  Klft  can  be  computed  from  the  LFT  F and  has  the  following 
realization: 


K,,.  -  F,(/.0)  -  /„ 


A+BF+HC 

-H 

F 

0 

It  is  not  difficult  to  identify  that  the  transfer  function  of  optimal  controller  K,,t  is  exactly  the  same 
as  that  derived  from  the  traditional  LQG  theory.  This  means  that  the  LQG  formulation  can  be  handled 
easily  in  the  general  synthesis  procedure  presented  in  this  chapter. 


The  next  example  is  a  //-^-optimal  control  problem. 

Exnznpl*  St 

Figure  2-10  shows  a  familiar  feedback  control  formulation.  The  external  inputs,  commands  and  dis¬ 
turbances  at  inputs  of  the  plant,  are  assumed  to  be  L^-bounded  signals.  The  performance  is  measured  in 
terms  of  the  energy  of  weighted  tracking  errors  (sensitivity  minimization)  and  weighted  control  input  sig¬ 
nals  (preventing  actuator  saturation). 


Figure  2-11 


By  assumptions  on  the  input  signals  and  the  performance,  it  is  known  |D16,F4,F5,Z2,Z3|  that  Ibis  is 
an  H  *  optimal  control  problem  and  the  general  distance  problem  arises  (Eq.  2-52). 


An  interesting  fact  is  that  the  matrix  R  does  sot  necessarily  have  the  2X2  block  structure  as  in  Eq. 

r*  12 1 


2-53.  For  some  control  problems,  only  two  blocks  appear  in  the  GDP,  that  is,  either 
^2i  2* J  ■■  0.  This  can  be  seen  clearly  from  this  example. 


1  22 


0  or 


Note  that, 


Then, 


0 

0 

°  r 

1 

0 

p  11  - 

-  w2g 

w2 

"  W,  [~G  l. 

1  - 

w3 

21 


Tn  -  Pn  +  PnU,MP 


21 


+  Pl2U.M)  P 21 


(from  Eq.  2-24a) 
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(from  Eq.  2-24c) 


where 


and 


From  Eq.  2-53, 


Therefore, 


This  is  a  special  general  distance  problem  and  will  be  referred  as  "2-block  GDP.  Both  the  2-block  and 
the  full  2x2  block  cases  will  be  discussed  in  later  chapters. 


CHAPTER  3 

GENERAL  DISTANCE  PROBLEMS 


This  chapter  is  devoted  to  a  summary  of  results  for  the  "General  Distance  Problem"  (GDP): 

Rn  R 12 

Given  R  =*  p  P  6  L&  ,  find  an  element  Q  €  //„  such  that 

R  n~  Q  R 12 

/?2,  oo 

is  minimized.  Note  that  the  minimum  value,  denoted  as  7, ,  is  the  distance  from  R  to  the  set  of  (matrix) 
functions  of  the  form 


Q  0] 

0  0  1  ^  ^  ^00  • 

This  formulation  of  the  GDP  is  called  the  "4-block  problem”  to  distinguish  it  from  the  specialcases 
r  l  R  12 

where  l/fji  ^22!  or  ^  is  identically  zero.  This  special  case  will  be  referred  as  the  "3-block  prob¬ 
lem".  Note  that  if  both  f?2aj  and  are  zero,  this  formulation  is  known  as  the  "best  (or 

Hankel)  approximation”  problem  [A1,A2,B1,G2]. 


The  GDP  can  also  be  regarded  as  a  matrix  dilation  problem  with  the  constraint  of  causality  on  Q . 
The  subject  of  constant  dilation  will  be  reviewed  first  in  Section  3.1  where  Parrott's  theorem  on  norm¬ 
preserving  dilations  (Dl,Pl,P4|  will  play  a  central  role  in  establishing  the  existence  of  the  optimal  solution 
of  the  GDP.  The  Hankel  operator  and  its  relation  in  best  approximation  is  then  treated  in  Section  3.2 
using  Parrott's  theorem.  Section  3.3  introduces  the  Toeplitz  operator  and  some  of  its  algebraic  properties 
which  provide  insight  into  the  GDP.  Generalizing  the  approach  in  Section  3.2,  an  abstract  operator 
point  of  view  is  adopted.  The  existence  of  an  optimal  Q  and  expressions  for  the  optimal  norm  are 
detailed  in  Section  3.4.  These  expressions  are  given  in  terms  of  an  operator  norm  or  an  equivalent  stan¬ 
dard  eigenvalue  problem.  The  operator  and  eigenvalue  problem  are  of  infinite  rank  and  the  results  do  not 
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yield  computable  formula*  for  either  the  optimal  norm  or  Q  .  Nevertheless,  the  existence  of  the  optimal 
solution  is  established.  A  more  practical  approach  will  be  discussed  in  chapter  4. 

The  results  of  this  section  and  the  next  are  more  easily  obtained  using  functions  on  the  unit  disc 


instead  of  the  half-plane.  Since  there  is  a  well-known  isometric  isomorphism  between  and  H ^  on  the 
half-plane  and  the  disk  (see  Appendix  B),  the  general  distance  problem  on  the  disk  will  be  considered 
throughout  this  and  the  next  chapter. 
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3.1  Dilation  of  Constant  Matrix 


Consider  the  optimization  problem  of  finding 


where  .V,  B ,  C ,  and  A  are  constant  matrices  of  compatible  dimensions.  The  matrix  Aj  *s  a 
lion  of  its  submatrices  as  indicated  in  the  following  diagram: 


1  1' 
M 


Here  c  stands  for  the  operation  of  comprestion  and  d  stands  for  the  operation  of  dilation  .  Compression 
is  always  norm  decreasing;  however,  dilation  can  sometimes  be  made  to  be  norm  preserving.  Norm 
preserving  dilations  are  the  focus  of  this  section.  Since  it  is  not  the  purpose  of  this  work  to  review  the 
related  theory  throughly,  only  useful  facts  will  be  presented;  more  complete  treatment  on  this  subject  can 
be  found  elsewhere  [D1,D16,P1,P4], 


The  simplest  matrix  dilation  problem  occurs  when  finding 


Although  Eq.  3-3  is  a  much  simplified  version  of  Eq.  3-1,  it  contains  all  the  essential  features.  It  is 
immediate  that  7*  “  1 1  A  1 1,*.  The  following  theorem  characterizes  all  solutions  A'  to  Eq  3-3. 
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Theorem  1 

V7  >  7,  , 


if  and  only  if 


a: 

A 


1 00 


<  7 


3  Y  with  ||  Y  II*  <  1  (i.e.,  Y  is  a  contraction)  such  that  X  =•  Y (^1  -A*A  )Vt. 


This  theorem  implies  that,  in  general,  Eq.  3-3  has  more  than  one  solution.  X  —  0  is  the  central  solution 
but  others  are  possible  unless  A* A  =»  72/.  A  more  restricted  version  of  the  theorem  is  the  following 
corollary. 


Corollary  3 

V  7  >  7.  , 


X 

A 


II  W/-AM)-,'4||*  <  1 


The  corresponding  dual  results  are  stated  as  the  following  theorem. 


(3-4) 


Theorem  3 

V  7  >  7.  . 

IIIA-  A  ]  ||*  <  7 

if  and  only  if 

3  Y  with  ||  Y  II*  <  1  (i.e.,  Y  is  a  contraction)  such  that  X  =  (i‘I -A.\*)inY. 

Corollary  4 


V  7  >  7, 
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III*  *111.  <1  **  II.  <  1.  (3-5) 

The  following  theorem,  usually  attributed  to  Parrott  [Par),  characterizes  7,  and  will  play  a  central 
role  in  establishing  the  existence  of  the  optimal  solution  to  the  best  approximation  and  general  distance 
problem. 


Theorem  5 


7.  —  max  ||  1  C  A  ] 


(3-6) 


As  in  Eq.  3-3,  there  may  be  more  than  one  solution  to  Eq.  3-1.  The  following  theorem  parametrizes 
all  solutions  to  Eq.  3-1. 


Theorem  5 

Suppose  7  >  7, .  The  solutions  X  such  that 

\x  B 
[(7  A 

are  exactly  those  of  the  form 

X  -  -YA*Z  +  7(/->T*)wWr(/-Z*Z)1*  (3-8) 

where  W  is  an  arbitrary  contraction  (||  W  U®  <  1)  and  Y  and  Z  solve  the  linear  equations 

B  -  Y(72l-A*A)ln  (3-9) 

and 

C  =»  (t2/  -AA  •)**£.  (3-10) 


<  7 


(3-7) 


The  following  corollary  gives  an  alternative  version  of  Theorem  6. 


Corollary  7 


V  1  >  7«  . 


.V  fl” 
C  A 


<  7 

00 


il  and  only  if 


||  (/-lT*)-^  +  KMZ)(/-Z*Z)-W  |U  <  7 


where 


(3*11) 


(3-12) 


K  -  Btf[-A*A)-i/a 

Z  -  tfl-AA'Y^C.  (3-13) 


There  are  many  alternative  characterizations  of  solutions  to  Eq.  3-11,  although  the  formula  in  Eq. 
3-12  seems  to  be  the  simplest. 

The  restriction  that  7  >  7.  in  Corollary  7  doec  introduce  some  loss  of  generality.  If  these  alterna¬ 
tives  to  Theorem  6  are  used,  it  is  not  possible  to  get  all  solutions  for  7  «•  7,  .  The  set  of  all  solutions 
arbitrarily  close  to  the  optimal  7,  is  the  best  that  can  be  done.  The  reason  for  considering  this  special 
case  is  that  Eq.  3-12  has  a  reasonably  straightforward  generalizations  to  the  rational  matrix  case,  whereas 
Eq.  3-8  does  not.  A  further  difficulty  with  the  rational  case  is  that,  unlike  the  constant  case,  it  appears 
that  it  is  not  possible  to  actually  compute  7,  exactly.  This  makes  our  inability  to  obtain  a  direct  general¬ 
ization  of  Theorem  6  seem  less  critical,  at  least  with  respect  to  application  of  this  theory. 
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3.2  Hankel  Operator*  and  Beat  Approximation  Problems 

Two  classes  of  operators  on  a  Hilbert  space,  Hankcl  operators  and  Totplitz  operators,  have  played 
an  important  role  in  function  theory  on  the  unit  circle.  In  recent  years  the  theory  of  Hankcl  operators 
has  attracted  increasing  attention  in  the  areas  of  control  and  systems  theory,  mainly  due  to  its  applica- 
tions  in  model-reduction  and  best  approximation. 

The  best  (or  Hankcl)  approximation  is  reviewed  in  this  section  where  the  Hankcl  operator  plays  an 
essential  role.  The  problem  is  stated  as  follows: 

For  given  (matrix)  function  G  6  L  *  ,  finding  a  function  Q  6  //»  such  that 

| )  <7  -  Q  1 1 jo  is  minimized. 

This  problem  was  first  solved  by  Nehari  for  general  scalar  discrete  systems  JN1].  The  multivariable  ver¬ 
sion  was  later  solved  completely  by  Adamjan  ct  al.  |A1,A2).  Recently  Glover  considers  a  special  case  of 
the  problem  where  the  function  to  be  approximated  is  real-rational,  i.e.,  G  6  RLX.  An  efficient  and  con¬ 
structive  algorithm  for  the  optimal  solution  was  constructed  in  terms  of  a  balanced  realization  of  the  sys¬ 
tem  [G2|.  For  the  real-rational  case  this  is  a  most  effective  algorithm.  A  simple  proof  of  existence  of  the 
minimizing  solution  Q  is  possible  using  a  Parrott/Davis-Kahan- Weinberger  theorem  on  norm-preserving 
dilations  [015,D16,P4|.  This  approach  is  quite  elegant  and  will  be  used  later  in  the  proo's  related  to  the 
GDP. 


Assume  that  G (z  )  belongs  to  iB(I)  with  the  power  series  expansion 

G(z)  -  £  G,  z' . 

I  ■  -0D 


The  following  definitions  will  be  used  throughout  this  chapter. 


Definition  :  (Multiplicative  or  Laurent  Operator) 


The  Multiplicative  (Laurent)  operator  Me  generated  by  G  (z  ),  is  defined  as 

Mg  :  HT)  -  L*T) 

/  —  MaJ  -  G  / 


Definition  :  (Hankel  Operator) 

The  Hankel  operator  He  generated  by  G  (z ),  is  defined  as 

He  :  H4T)  -  H2L(T) 

I  -  Hc/  -  (PHiMc)/ 

Remark 

A  matrix  representation  of  He  is 

G-\  G  -i  <7 -a  . 

G  _j  G  _j  •  . 

•  •  . 

•  •  •  . 

•  •  •  . 

•  •  •  . 

which  is  the  well-known  (infinite)  Hankel  matrix. 


Theorem  1  (Beat  Approximation) 

Consider  the  following  minimization  problem: 

7.  *  min  ||  G  -  Q  |L 

Q  6  Hm 


Then, 


7.  — 

and  the  minimum  is  achieved  by  some  Q  6  H 


1 1  »e  || 


(3-14) 


Proof 


fd 
\  V  s  S 


The  inequality 


»iw  ■!■<'■'  imi  ft'^r.  y.T1 


Cs-‘ 


iy 


.% 

V 

> 

-V 


A' 


Inf  {||C  -  Q  lloo  :  <?  6tf»(r)J  >  ||  Hc  || 
is  easy  to  establish.  Fix  Q  in  //*(  7* ),  then 

II  C  -  Q  II.  -  »«P  {||(G  -  Q)f  ||*:/  eHJT),  \\  f  ||2<l) 

>  sup  jUP^ilC  -  Q)f  1 12 :  /  €  H£T),  ||  /  1 12<  1 
«  sup  j  1 1  Phl  Gf  ||,:/€W).||/  ||2<l| 


Hc 


and  take  the  infimum  over  Q  . 

Since  /  €  H^T ),  it  can  be  written  as  the  power  series  expansion 

/(*)  -  £  /.*• 

1^) 

and  let 

*(*)  -  » a/  =  PHi(Gf  )  -  £  h,  z ' 

Then  the  matrix  representation  of  He  can  be  expressed  as 


-1 


'*-1' 

<?-» 

G-2  . 

7.  ' 

A -2 

G-t 

G+  . 

/. 

G 

G  -4  . 

I  2 

• 

• 

•  . 

• 

• 

• 

•  . 

• 

• 

• 

•  . 

• 

This  is  exactly  the  matrix  representation  of  the  Hankel  operator  generated  by  G  . 


Now,  it  is  sufficient  to  show  there  exists  a  function  Q  in  H^T)  such  that 


lie  -  Q  IL  -  II  Hc  II  . 

By  a  result  of  Parrott’s  [P1,P4],  there  exists  a  Q0  such  that  the  norm  of  the  Hankel  matrix  is  preserved, 


i.e.. 


II  Hc 

Similarly,  there  exists  a  Q,  such  that 

G.-Q,  G.i 
G-\  G.  2 


G.-Q.  G 

G-i  G-2 


<M-<?  l 

G.-Q 

G.-Q , 

G-i 

G-i 

G.t 

Continuing  in  this  way  gives  a  suitable  Q  (r )  =  £  Q,z'  €  //<*,(  7")  such  that  the  minimal  norm  is 

■ 

achieved. 


QED 


The  following  corollary  is  immediate. 

Corollary  1 

II  Hc  ||  -  (3-15) 

where  Xn^jHc  *HC|  is  the  maximum  eigenvalue  of  the  matrix  [He  *HC|- 

Note  that  if  G  is  real-rational,  then  there  exists  an  optimal  solution  Q  G  RHX.  Related  materials 
are  presented  in  Chapter  5  where  an  algorithm  for  constructing  the  optimal  Q  is  given. 
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3.3  Toepllts  Operator* 

For  the  GDP,  the  solution  is  more  complicated  and  both  Hankel  and  Toeplitz  operators  are 
involved.  Therefore,  a  overview  of  the  necessary  properties  of  Toeplitz  operators  will  be  established  using 
simple  algebraic  procedures.  More  details  on  Toeplitz  operators  can  be  found  in  standard  publications 
[B5,D7,D8,S3|. 

The  definition  of  Toeplitz  operator  is  given  in  the  following. 

Definition:  ( Toeplitz  Operator) 

The  Toeplitz  operator  generated  by  G(z  )  6  L^T)  ,  is  defined  as 

T*  :  H^T)  -  HAT) 
f  -  T c/  -  (PHjMc)/  . 

Remark 

A  matrix  representation  of  Tc  is 

Gq  G  i  G 2  . 

G_,  G0  Gi  . 

G-i  Go  . 

•  •  •  . 

•  •  •  . 

•  •  •  . 

which  is  the  well-known  (infinite)  Toeplitz  matrix. 

In  the  following,  some  of  algebraic  properties  of  Toeplitz  operators  are  reviewed.  The  first  lemma  is 


well-known. 


Lemma  1 


t|Tc  ||  -  ||  Me  ||  -  || 

Proof 

The  proof  can  be  found  in  |B5,D7|. 

Lemma  2 

Tr*  “  T 

Proof 

V  /  .  g  €  H^T), 

<  Tc./  ■  J  )  -  (Ph,Mc./  ,  9  > 

”  <  /  -  McPHj$  > 

-  <  /  i  Mc  9  > 

*“  <  Ph2/  -  Me  9  > 
**  <  /  ,  PhjMc  g  > 

-  <  /  ,  Tc  g  > 

-  <  Tc'/  ,  9  > 

Tc.  =  Tc*. 


Definition: 


The  operator  9<j  generated  by  C  (r  )  €  £<J  T)  is  defined  by 


Lemma  3 


SC  -  H*.. 

Proof 

V  /  6  ff 2  (T)  and  g  6  H£T), 

<  S a!  ,g>  -  <PHjMc;/  ,  g  > 

-  <  /  ,  M«*P Hag  > 

-  <  /  .  Mc.;  > 

“  <  PHji/  .  M(;.f  > 

•  <  /  .  > 

-  <  /  .  > 

-  <  H  *./  ,  f  > 

S,;  -  H*. 

Lemma  4 

Assume  <fr,  ♦  €  Loo(T).  Then 

(i)  T**  -  T*T*  -  H£H*.  (Sarason) 

(ii)  T*„  —  T*T+  if  either  ♦•or*  belongs  to  H ^(T ).  (Brown-Halmos) 

Proof 

(i)  V  /  6  HAT), 

T  $♦/  —  P 

-  PhM^M*/ 
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-  PHjM*{Ph,  +  PHai}M*/ 

-  (Ph,M*PHjM*)/  ♦  (Ph8M*PhiM*/  ) 

-  (PH  M*PHj)(PhM*)/  +  (phm*pH2i)(p„m*)/ 

-  (T*T*  +  S*H*)/ 

From  Lemma  3,  S«  ™  and 

T$*  -  T*T*  +  H£H* 

(ii)  If  either  or  ♦  belongs  to  H^T),  then 

■  0  or  H*  *■  0. 

From  (i),  The  equality  follows  immediately. 

QED 

Remark 

Lemma  4  shows  that  the  Toeplitz  operator  does  not  have  the  multiplicative  property  in  general 
except  for  the  case  of  Lemma  4-(ii). 

Definition: 

The  operator  T<;  generated  by  G(z)  6  Lotlf)  is  defined  as 

T c  :  H3l(T)  ->  H3L(T) 

f  -  Taf  -  (P„iM G)f 


Remark 


The  operator  Tc  is  unitarily  equivalent  to  the  Toeplitz  operator  by  means  of  the  unitary 
operator  U  which  is  defined  as 


U  :  H2l(T)  -  HAT) 
U  /  =  zf* 


that  is,  TC,U  -  U  Tc  [D9|. 


Lemma  5 


The  multiplicative  (Laurent)  operator  can  be  represented  as 


Mg 


Tc  Hc 
Sc  Tc 


Proof 

Let  L2  H  2  @H2,  then 


Mc  :  Hj  QH 2  -»  Hi  QH2 


M  e 


Tc  Hc  U 
Sff  T (7  /  2 


where  fi€H2  and  f  2€  H  2. 
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3.4  Optimal  Solutions  of  General  Distance  Problems 


The  fundamental  issue  in  tbe  general  distance  problem  is  the  existence  of  the  optimal  solution.  In 
this  section  it  will  be  shown  that  the  optimal  solution  of  the  GDP  exists.  Since  internal  stability  (or 
equivalently,  causality)  is  required  ,  the  nature  of  the  problem  considered  will  be  very  different  from  that 
in  the  constant  matrix  case  (Dl,D16).  The  proof  is  essentially  a  generalization  of  that  for  best  approxima¬ 
tion  given  by  Doyle  [D15.D16]  where  the  Parrott/Davis-Kahan-Weinberger  theorem  of  Section  3.1  on 
norm-preserving  dilations  is  used.  In  case  that  R  is  real-rational,  it  can  be  shown  that  there  exists  a 
real-rational  optimal  solution.  The  2-block  GDP  will  be  studied  first.  The  results  are  then  generalized  to 
the  4-block  GDP.  Both  the  2  and  4  block  problem  have  been  studied  by  Feintuch  and  Prancis 
([F1|,(F2|).  Their  results  are  more  general  than  those  presented  here  because  they  consider  time-varying 
as  well  as  time-invariant  systems. 


Consider  the  following  2-block  GDP: 


•  « 

Q 

7  a  min 

<?  €»« 

R  - 

0 

where  R  =» 

00 

€  L  e 


Define  the  operator  T from  H  2  to  H  i  @  L  2  as  follows: 

r*  :  Ht  -*  Hi  QL, 


r  h! 


PH  ,1*./ 


R2I 


,  /  €  h2 


Theorem  1 

Consider  the  2-block  GDP  in  (2gdp),  then 


min 


r2 


lir, 


Proof 


(2gdp) 


(3-16) 


The  following  inequality  is  proved  first. 


*»-<?] 

*2  cc  “ 


111*11 


V  Q  6  Hoc, 


By-Q 


2  J  * 


(  (By  -<?)/  | 

•«P  (  p  /  :  /  €  Ht ,  ||  /  ||2  <  1  j 


PHl(^  i-<3)/  ! 

>  «UP  p  /  :  /  €  Hit  ||  /  ||j  <  1 


*V*./ 


:  /  €ff*.  ||  /  ||*  <  1 


lir*ll 


It  will  be  convenient  to  complete  the  proof  by  transforming  the  rhp  into  the  unit  disc  (see  Appendix  B). 
For  the  rest  of  the  proof  assume  that  this  has  been  done.  Since  Ry,  R2£  L^T),  they  can  be  written  as 
power  series  expansions: 


*i(*)  -  E  <***' 

i  —  -op 

b^)  *  £  fi>'k 

t  —  -CD 


/(*)  -  E  /.*'  2 


Now,  let 


MO  -  E<M.«‘ 


(«!-<?)/  p  ,  =  (*./)  p  ,  =  [  E  E  /.*'  I  P  . 

H2  ”2  (  *  —  -  oo  1—0  J  Kf 


Ising  these  expression  for  A  1(2)  and  ),  Eq.  3-16  can  be  written  as  the  following  equivalent  matrix 
equation: 


[(Ai)-i' 

a-i 

0-2  O.j  . 

/  0 

(*  l)-2 

a-2 

O-J  •  . 

/ 1 

(AiU 

o-a 

•  • 

/  2 

• 

• 

•  • 

• 

• 

• 

#  *  . 

• 

• 

• 

*  P\  . 

• 

(^2)2 

• 

01  0o  . 

• 

(*2)1 

** 

0i 

00  0-1  . 

• 

(A  2)0 

0-1  0-2  . 

• 

(*2)-i 

0-i 

0-2  0-S  . 

( A  2)— 2 

0-2 

0-a  . 

• 

(*2)-a 

•  •  . 

• 

• 

• 

•  •  ...... 

• 

• 

• 

•  • 

« 

• 

The  next  step  is  to  show  that  there  exists  an  optimal  Q  £  H ^  such  that  the  equality  in  Eq.  3-17 
bolds;  for  which 
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M 

A 

■ 

^ . 

1 

_ J 

111*11 


GL|  0-2 

a.2  <*-*  • 

•  • 

•  •  • 

♦  •  • 

•  •  P\ 

•  fit  0o 

0\  0o  0-\ 

00  0-1  0-2 
0-1  0-2  0-0 
0-2  0-0  • 

0-0  •  • 

•  •  • 

•  •  • 


From  Theorem  3.1.5  (Parrott's  theorem),  there  exists  a  Q  0  such  that 


lir*l! 


a0~  Qo  a-\  a-2 

a-l  a-2  a-0 

a-t  a-o  • 

a_j  •  • 

•  •  • 

•  •  • 

•  •  0i 

•  0\  0o 

0i  0o  0-i 

00  0-1  0-2 

0-1  0-2  0-0 

0-2  0-0  • 

0-0  •  • 

•  •  • 

•  •  • 


Next,  choose  such  that 


01 


<*1-  Qi 

ao~  Qo 

<*-l 

<*o~  Q  0 

a-i 

0.2 

<*-1 

a- 2 

®-4 

<*-2 

<*-» 

• 

t»-s 

• 

• 

• 

• 

• 

• 

• 

• 

• 

• 

4i 

• 

fit 

4o 

fit 

A) 

4-, 

A> 

4-i 

fi-2 

4-1 

0-2 

4-4 

0-2 

4-4 

• 

4-4 

• 

• 

• 

• 

• 

• 

• 

• 

Continuing  in  this  way  to  find  {Q0 ,  Q  lf  Qt,  . } 

R 1  -  Qopt 

r  =* 

n  2  oo 


such  that 

where  Q,„(r) 


S  <?.*’  6  //» 


■  — o 


This  concludes  the  proof  of  equality. 


QED 


The  minimal  norm  7,  can  also  be  expressed  in  terms  of  the  following  eigenvalue  problem  ([V4]). 


Corollary  1 

l|r*ll  -  (3-19) 

where  is  the  Hanktl  matrix  generated  by  R 1  and  is  the  Toeplitz  matrix  generated  by  R*R2 


03 


a~ i  a-2  O-* 
0-2  «-»  • 


•  •  • 

•  •  • 

•  •  01 

•  01  0o 

01  00  0 

00  0  0 

0  0* 


(3-21) 


because  {(j, }  are  chosen  to  be  norm-preserving.  The  zero  rows  in  Eq.  3-21  can  then  be  deleted  without 


affecting  the  norm. 


•  •  1. 


+  Trf.Trf,] 


Since  R  2  6  H by  Lemmas  3.3.2  and  3.3.4-(ii), 


Tffa*T/jJ  —  —  Th2.«j  “ 


This  completes  the  proof. 


Remark 


Corollary  2  can  also  be  proved  using  Lemma  3.3.4-(i)  (Sarason  [S3]).  Recognizing  that  Eq.  3-18  is 


equivalent  to 


X~.  +  Tb.'Tr.  + 


The  result  follows  immediately  from  Lemma  3.3. 4-(i),  since 


The  following  theorem  is  the  real-rational  version  of  Theorem  1.  The  theorem  is  stated  without 
proof  which  will  be  given  in  Chapter  5  where  real-rational  matrices  are  the  focused  topic. 

Theorem  3 

If  R  €  RLao  (real-rational)  in  (2gdp),  then  there  exists  an  optimal  Q  €  RH <*,  ■ 

Although  Theorem  3.1  and  Corollary  3.2  give  explicit  formulas  for  7, ,  the  formulas  cannot  be  used 
directly  to  compute  either  7,  or  optimal  Q  s.  Certain  observations  can  be  made  from  general  operator 
theoretic  considerations  which  indicate  the  difficulty  of  such  computations.  A  Hermitian  Toeplitz  opera¬ 
tor  has  no  point  spectrum  (i.e.,  no  eigenvalues).  This  is  known  as  the  Hartman-Winter  theorem  (see 
Douglas  (|D9|)).  Therefore,  in  Eq.  3-4,  has  infinite  rank  even  when  R2  i»  rational.  This  is  quite 

different  from  the  simple  best-approximation  problem  (e  g.  Adamjan,  Arov,  and  Krein  (|Al],[A2)),  and 
Glover  ([G2])),  where  in  the  real-rational  case  the  corresponding  Hankel  matrix  has  only  finite  rank  which 
is  equal  to  the  McMillan  degree  of  the  given  transfer  matrix.  While  neither  Theorem  3.1  nor  Corollary  3.2 
provide  a  numerically  attractive  method  for  computating  7, ,  which  remains  an  open  question,  existence  is 
settled,  and  the  Hankel  ©  Totplilz  structure  appearing  in  Corollary  3.2  can  be  analyzed  to  provide 
further  insight  into  the  problem  of  computing  7, . 

Since  for  rational  R  the  Hankel  part  of  the  operator  in  (3-4)  is  finite  rank  and  therefore  compact, 
the  operator  H» 'H*  T_._  can  be  viewed  as  a  compact  perturbation  H/»  *Ho  of  the  operator 

^RfRj  ^  standard  result  in  operator  theory  is  that  compact  perturbations  do  not  change  the  continuous 

part  of  the  spectrum  of  an  operator  (Gohberg  and  Krein,  [G4j).  It  is  easily  shown  from  these  general 
operator  theoretic  considerations  that  the  spectrum  of  Hr*H ^  consists  of  the  continuous  spec¬ 
trum  of  plus  a  finite  number  of  discrete  elements.  The  spectral  radius  is  then  either  equal  to  that 

of  T„,_  and  can  be  computed  from  pXfl  FT.  _  1  —  ||  /fjlloo  or  *s  achieved  on  the  discrete  part  of 


v'r*  '  . 


\r  - 


the  spectrum.  In  the  latter  case,  there  is  no  known  method  for  directly  obtaining  7, ;  this  has  motivated 
the  investigation  involving  approximations  and  bounds  that  appear  later  in  this  section  as  well  as  the 
iterative  scheme  in  the  next  section. 

Operators  with  the  Hankel  QToeptitz  structure  of  Corollary  3.2  have  been  investigated  in  some 
detail  by  Jonckheere  and  co-workers,  both  in  the  //*  context  (|V4|)  and  earlier  in  problems  arising  in 
LQG  (eg.  [Jl|).  Recently,  Jonkheere  and  Verma  ([ J2|)  have  proposed  a  scheme  for  estimating  7,  by 
computing  the  solution  to  a  single  Riccati  equation;  it  is  hoped  that  this  will  ultimately  lead  to  more  com¬ 
putationally  attractive  methods  than  are  currently  available. 

To  avoid  these  difficulties  which  arise  in  the  direct  approach,  an  iterative  scheme,  called  a  7- 
iteration,  will  be  proposed  in  Chapter  4.  The  rest  of  this  section  is  devoted  to  the  generalization  of 
Theorem  1  to  the  4-block  GDP. 


For  the  4-block  GDP,  i.e., 


R  n  -  Q  R 12 
R  a  ^  22 


(4gdp) 


let’s  define  the  operator  T R  as  follows; 


T*  :  H2QL3  —  ©/,, 


0  f*.i  r/ 


0  /  l/?a  R~i  ; 


Theorem  4 


Consider  the  4-block  GDP  and  assume 


R 11  R 1 
R  31  Ri 


00 


min 

«£Jf. 


R  12 
R  22 


I® 


lir.ll 


Proof 

The  proof  is  based  od  the  Parrott/Davis-Kahan-Weinberger  theorem,  and  follows  closely  the  proof 
of  Theorem  1. 


First  it  is  useful  to  establish  that 


min 


R  22  “  Q  R  12 

R  21  R  22 

»  « 


>  111*11 


(3-22) 


V  Q  €  Woo  - 

|Wn  -  Q  R 12 


21  n  22 


■up 


>  sup 


(Rn-Q)  f  +  R is  9 
R  21  f  +  R  22  9 


:  f  6  Lt 


*■! 


R  21  f  +  R  22  9 


'■  /  €  W2,  9  6  f<2> 


<  1 


■up 


Pj|i(Wu  /  +  R 12  9  ) 

R  21  f  +  R  22  9 


:  /  6  Wj,  t  €  L2, 


<  1 


-  IIMI- 


The  matrix  representation  of  the  operator  Tr  is  derived  as  follows.  Since  R  £  Lx,  its  four  subma¬ 
trices  can  be  written  as  power  series  expansions: 

00  00 

E  E 

*  •  -  oo  k  •  -  oo 

00  00 

E  /»***  E  ft** 

*•-00  *  •  -  00 
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The  next  step  is  to  show  that  there  exists  an  optimal  Q  6  H ^  such  that  the  minimum  is  achieved. 
Therefore 


7.  -  l|r*  || 


O-i  a-i 
0-2 


'll  Vo  V-i  V-2 
Vo  V-i  V-2 


0\  00 
0o  0.x 
0-X  0-2 


0-2 


Cl  fo 
Ci  <b  C-i 

Ci  Co  C-i  C-2 

S»  C-l  C-2  • 


•  •  • 


•  •  •  • 


(3-23) 


Applying  the  norm-preserving  dilation  repeatedly,  {(Jo,  Q  w  Q  j,  . }  can  be  found  such  that 


A  11  “  Q'ft  A 12 

A  21  A  22 


where  Q,„(z)* 

i  -»o 


This  concludes  the  proof  of  equality. 


QED 


Identifying  the  structure  in  Eq.  3-23  with  the  operators  defined  in  Section  3.2  and  3.3,  the  following 
corollary  follows  immediately. 


Corollary  8 


(3-24) 


Similar  to  2-block  case  (Theorem  3),  the  following  theorem  will  be  proved  later  in  Chapter  5. 


00 


Thaortm  6 

If  R  €  RL^T)  (real-rational)  in  (4gdp),  then  there  exists  an  optimal  Q  €  f?//®. 

Although  the  existence  of  the  optimal  solution  is  also  justified  for  the  4-block  case,  Theorem  3 
suffers  the  same  criticism  as  in  the  2-block  case  of  being  nonconstructive.  The  associated  eigenvalue  prob¬ 
lem  (Corollary  5)  is  much  worse  than  that  in  Corollary  2.  An  alternative  approach  must  be  sought.  This 
will  be  the  subject  of  the  next  chapter. 


4.1  7- Iteration  and  Bounds 


la  this  section,  we  present  an  iterative  scheme  suggested  in  Doyle  (1983,1984)  to  solve  the  general 
distance  problem.  The  idea  is  that  by  guessing  a  value  for  the  minimal  norm,  7, ,  the  distance  problem 
can  be  simplified  to  an  equivalent  best  approximation  problem  which  can  be  solved  by  existing  algorithms 
(for  example,  Glover  (1984)).  This  guess  can  be  iterated  to  obtain  convergence  to  the  optimal  norm  and 
optimal  Q .  This  7-iteration  procedure  was  suggested  independently  by  Francis  (1983)  (Chapter  8)  for  th» 
2-block  GDP  and  by  Verma  and  Jonckheere  (1984)  for  the  so-called  "SISO  mixed  sensitivity  problem,”  a 
2-block  case. 

Again,  the  2-block  and  4-block  cases  will  be  considered  separately.  Theorems  4.1  and  4.2  lie  at  the 
heart  of  the  7-iteration  scheme.  In  what  follows,  M  will  be  called  a  spectral  factor  of  a  rational  para- 
Hermitian  5  if  M  6  RH  &  i®  a  unit  (A/~l  6E  RHX),  and  M*M  =S. 

Theorem  4.1 

Assume  Q  6  and  7  >  1 1  /?  2  |  |oo-  Then 

[/?!-<? 

R a 

if  and  only  if 

||  (Rl-Q)M-1  |U  <  1  (4-2) 

where  Af  is  a  spectral  factor  of  the  para-Hermitian  matrix  (7*/  -  R  2*/?  2)- 

Proof 

Note  that 

(*,-<?)•(/?,-$)<  ri  -RfRt 

evaluated  on  the  imaginary  axis  is  equivalent  to  both  (4-1)  and  (4-2). 


<  7  (4-1) 


The  theorem  means  that  if  7  >  1 1  /? 2  I  loo>  there  exists  a  Q  6  //®  ®uch  that  |  |/f  _1 

and  hence,  ||HRjV-i||  <  1.  For  such  a  Q,  Q  M  will  satisfy  Eq.  4*1.  Solutions  to  Eq 

obtained  by  considering  the  following  best  approximation  problem 

fh)  -  Jrtn~\l*iArl-Q\L  - 

Therefore,  the  general  distance  problem  can  be  reduced  to  a  best  approximation  problem.  In  the  case  that 
the  function  RtM~l  is  real-rational,  the  algorithms  in  Glover  (1984)  can  then  be  applied  to  solve  for  the 
optimal  Q  6  f?/f®  corresponding  to  the  given  7. 

The  case  where  7®*  ||f?j  ||®  must  be  handled  slightly  differently  because  the  factorization 
required  in  Theorem  4.1  would  yield  an  M  with  zeros  on  the  axis  and  thus  not  a  unit.  A  necessary 
condition  for  4-2  to  hold  is  that  R  tAf  "l  6  L0 0  even  though  A/'1  is  not  in  L®.  While  this  is  possible 
conceptually  it  is  a  condition  that  can  never  be  verified  numerically  and  we  must  settle  in  practice  for  the 
conditions  of  Theorem  4.1. 

Using  Theorem  4.1,  we  can  view  7  in  4-3  as  a  function  of  7  and  iterate  on  the  choice  of  7  until 
7  **  1.  The  following  procedure  is  a  general  description  of  the  7-iteration  procedure  for  the  2-block  prob¬ 
lem. 

(i)  Compute  the  lower  bound  ||/?s|!®. 

(ii)  Choose  7  such  that  ||f?2||®  <  7. 

(iii)  Find  a  spectral  factor  M  ■»  (7*/  -  R£Ro)l/i. 

(iv)  Let  7=||H/?iW-,||. 

(a)  if  7>1,  go  to  (ii)  and  choose  a  larger  7. 

(b)  if  7<  1,  go  to  (ii)  and  choose  a  smaller  7. 


-Q  II®  <  1 

.  4-2  can  be 


(4-3) 
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(c)  if  7*1  go  to  (v). 

(v)  The  value  of  7  is  the  minimal  achievable  norm.  Find  a  best  approximation  of  RlM'1,  denoted  by 

Q.- 

(vi)  An  optimal  solution  Q,fl  *  Q  tp,  M . 

A  numerical  implementation  of  this  procedure  could  not,  of  course,  evaluate  7  exactly,  so  the  test  in 
(iv-c)  would  never  be  met  exactly.  This  procedure  could  be  used  to  Bnd  a  Q  that  yields  a  solution  arbi¬ 
trarily  close  to  7, .  In  order  to  do  this,  some  scheme  for  selecting  the  next  guess  for  7  in  step  (iv)  is 
needed  and  the  convergence  properties  of  the  procedure  would  depend  on  this  scheme.  The  next  section 
will  focus  on  the  properties  of  7  as  a  function  of  7  that  make  it  possible  to  converge  rapidly  to  7 ,  . 

Since  the  approach  proposed  is  an  iterative  one,  it  will  be  helpful  if  the  upper  and  lower  bounds  can 
be  provided  in  advance.  The  bounds  in  Theorems  2  and  4  are  quite  useful  since  they  are  relatively  easily, 
computed  and  the  upper  and  lower  bounds  have  a  ratio  no  greater  than  >/2  in  the  2-block  case  (Theorem 
2)  and  2  in  the  4-block  case  (Theorem  4). 


t 


Theorem  2 

Let 


rx-q] 

70  =*  min 

<?  e 

R 2  J 

11**11. 


7/ ,  =*  max  |  l|Hfl  ||  ,  ll^zlU  J 

7/,  =  max  |  ||H*J|  ,  ||f?2||*  J 


Then 


72 


7,  <  7/, 


QED 

The  following  two  theorems  are  the  generalizations  of  Theorems  1  and  2  to  the  4>block  GDP 


Theorem  3 


R  n  -  Q  R  it 

R  21  R  22 


(7  >  7/j) 


if  and  only  if 


(/  ~-Q  }(/-£•£)■ 

1  7  7  7  J 


Va 


<  1 


where 


(/  -IL*)W 
(/  -  L  *L  )w 
L 
L 
S 
S 


spectral  factor  of  (/  -LL  *) 
spectral  factor  of  (/  -L  *L  ) 


RaS~' 


S’lR  j, 

(t2/  -R&RBy* 


and 


F  ,[Lr,1.R&)  -  I  ( /?  n  +/f,2(72/  -R&Ra)-xR&Rn\ 
7  7  7  1  1 


Proof 


From  Theorem  1, 


T  satisfies  T*T  —  (~rl  -  ,  J  |^a  Ra J)  and  can  be  chosen,  from  Lemma  A.3, 


'  7(7  -  L*L 

r(s-'YRi 


•L)in  0 
R  &R  21  ^ 


[fin- <3  fi«]  r-* 


-(/ 

[fi  n  -  <3  fi  12]  j  .  . 

L  J  -(72/  -R&R^Y'R&RiAl  -L*L)'W  5-1 


=  Y  j  (fill  +fiu(l2/ -fi^fiar'fiiSfin)  -<3  }  (7-L*L)-1/s  fil2S-‘ 


Recognizing  that 


7  (fin +fiizh27 -fiiSfia^fi^fin)  -  F,(-fi  ,-fi-S) 

7  7  7 


which  is  a  linear  fractional  transformation,  one  has  that 


k 


L^L)-*  L 


Using  the  dual  form  of  Theorem  1,  the  inequality  4-9  is  equivalent  to 


(/  -U*)-w  (  Ft(  —  R  ,— fija  )  -  — G  ]  (/  -£*£)' 

l  7  7  7  ) 


Remarks 


(i)  This  theorem  is  a  generalization  of  Corollary  3.1.7  which  was  proved  in  [D l] .  The  difference  is 
that  the  causality  is  a  constraint  in  the  problem  considered  b  \e 


(ii)  5  and  5  need  not  to  be  spectral  factors.  5  and  5  can  be  any  square  transfer  function  matrices 
such  that  5*5  -  (t2/  -  fi^fi-)  and  5  5*  -  (r!  -  R  ~R  &  ). 


(iii)  The  linear  fractional  transformation  F((  —  R  ,  —  R&)  has  no  poles  on  the  ;  w«axis.  This  is  true 

7  7 

because/?  €  /-»  and  7  >  HfialU- 


The  next  theorem  provides  the  upper  and  lower  bounds  for  the  7-iteration  of  4-block  GDP. 


Theorem  4 


ft  11  -  Q  ft  12 
fta  ft  22  _  oo 

IlHjrJI  1 

ll^islloo  p 

I II« ax  Gallic.  2 


I  |Hff  u|  I 

I  l^2ll  loo 


IIHaJI  IIRuIIo 

ll*»ll.  \\ft72\U 


max  {  ||H„||,  ||[i?a/?sJ|U,  ^  j 


max  H 


I |H[/ru *,3,1 1  •  lll*a*dl 


ll® 


|Hj»u||.||tfalL.  \\ft12W00.  ||*a||„  } 


4  7.  <  7 it  <  7i j  <  7<,  <  7,  <  7»,  <  7«t  <2  7. 


Proof 


v*  N  V\%-  %  %  V*  .  .*•  .N  I 


Let  Q„I .  Q  ,  Q  ,pt  satisfy 


*  u  -  Qopt  * 12 

*  21  R  22 

III* n -  <?  »ri  *  «j 


—  min 

®  96«, 


R  n~  Q  R 12 
*21  /?22 


I®  -  ^  III*n  -  Q  R 12I 


and 


^11  -  Q  „t  lice  -  -min  II  Rn  -  Q 

v  t  n 


The  proof  is  broken  into  two  parts: 


(A)  Upper  bounds  (7,  <  *U  ,  <  7ua  <  2  7,  ). 


To  establish  that  7,  <  7.  ,  note  that 


R  11  ~  *  12 

*21  /?  22 


1  r*  a  -  q  ip!  r  12 

I  [  R  21  R  23 

I  r  i  m*  u  -  .fi  r  121 

II  III*a*«lll® 


R  n  -  Q  >pt 

1 1*21 1  loo 

R  21 
R  22 


7,  <  7«,  • 


That  7,^  <  7Uj  is  obvious. 


To  demonstrate  that  7,  <  2  70  .  note  that 


i*u- <?,„  Hoc  11*  is  1 

1 1  *  21 1 1 00  ||*2:| 


II*  11  -  Q>pt  I  loo  ||o  12 1 ! jo 


1*21  I 


II* 

1 1*  22!  loo 


(from  Lemma  A.l) 


(from  Theorem  2) 


00 


(from  Ltmm  A. 2) 


<  2 


21  22 


7«.  <  27.  . 


(B)  Lower  bounds  (  -  7,  <  7/,  <  7/,  <  7<t  <  7.  )  . 


To  show  that  7,  <  7,  note  that 


IIHhII  -  min  ||tf-$||„  < 

V  6  /*<*> 


/?  - 


Q.ji  0 

0  0 


In  addition,  it  is  easy  to  see  that 


lilft2i  #22]!!®  <  7,  and 


R 12 
R~> 


To  show  that  7 (j  <  7,  note  that  both  (i?2i  and 
the  inequality  follows  immediately. 


R 


12 


R, 


<  7.  - 


are  the  "compression”  of  R 


Since  R  n  is  a  "compression"  of  [R  n  i?l2j  and 


Rn 
R  2j 


,  and 


max 


/?  12 
f?22 


ll*«IU  ||*ulU  llAslloo}  <  max  j  |  ||H 2,^22! I  too. 

%  <  7/j  • 

Finally,  to  demonstrate  that  y  7,  <  7^  note  that 

7.  <  7.,  <  2max[  !  |H* u|  |,  1 1* „ |  L,  | \R u\ \m,  |  a|  U  j  -  2  7,, 


i’.  ^  ii,. 


,  hence, 


QED 


Remarks 


(i)  Note  that  |  [/?  |  )<„ 


is  also  an  upper  bound. 


flu  A 12 
R  21  R22 

(ii)  The  proof  of  Theorem  4  implies  that  the  7-iteration  may  be  avoided  altogether  by  simply  using 
Q  ,pl  instead  of  Q„pl ,  with  the  guarantee  that  the  solution  will  be  suboptimal  by  no  more  than  a 
factor  of  2.  Note  that  Q  ,p(  is  simply  the  best  approximation  to  Ru.  For  the  2-block  case  this 
approximation  method  will  be  suboptimal  by  no  more  than  \/2. 

This  section  will  be  ended  with  a  description  for  the  7-iteration  procedure.  Both  2-block  and  4- 
block  cases  will  be  discussed. 


I.  2-block  c»*e  (  7-lteratlon  procedure)  i 

(i)  Compute  the  lower  bound  HA2II00' 

(ii)  Choose  7  such  that  ||fl2||ao  <  7. 

(iii)  Find  a  spectral  factor  M  =»  (7 2 1  -  R  2*fl2)l/5 

(iv)  Let/ (7)-  HH^.,11. 

(a)  if  /  (7)  >  1,  go  to  (ii)  and  choose  a  larger  7. 

(b)  if  /  (7)  <  1  and  7  >  |  |fl2|  I*,  go  to  (ii)  and  choose  a  smaller  7. 

(c)  if  /  (7)  =  1,  go  to  (v). 

(v)  The  value  of  7  is  the  minimal  achievable  norm.  Find  a  best  approximation  of  RiM~l,  denoted  by 
Q  apt  ■ 

(v i )  An  optimal  solution  is  Q,pl  =  Q  ,pl  M . 


(i)  Compute  the  lower  bound  7, 


max 


111^21  #22!  I  la 


R 12 

R  22 


(ii) 

(in) 

(iv) 

(v) 


Choose  7  such  that  7  >  7, . 

Find  the  spectral  factor  5  =—  (7 2I  -R&R a)1*  and  S  —  (7 21  -R^Ra)^-  ^et  £  “  fli2S~' 
and  L  >-  S'lR  2i- 

Find  the  spectral  factor  (/  -LL  *)w  and  (/  -L  *L  )w. 


Let  /  (7) 


H 


(/  -LL  •)- **F,  (~R  ,-K£  V  -L  •£ )  -  V» 
7  7 


where  F,  (—/?,—/?  22 )  is  a  linear  frac- 
7  7 


tional  transformation, 

(a)  if  /  (7)  >  1,  go  to  (ii)  and  choose  a  larger  7. 

(b)  if  /  (7)  <  1,  go  to  (ii)  and  choose  a  smaller  7. 

(c)  if  /  (7)  —  1,  go  to  (vi). 

(vi)  The  value  of  7  is  the  minimal  achievable  norm.  Find  a  best  approximation  of 

(/-LL*)-,/7F,(-R  ,-R^)(/-LtL)-^,  denoted  by  Q  . 

7  7 

(vii)  An  optimal  solution  is  Q,pt  =>  7 (/-LL*)w<j  ,pt  (I-L  *L  )w  . 

The  algorithms  are  not  complete  without  some  method  for  selecting  the  next  guess  for  7  in  step  (iv) 


(2-block)  or  step  (v)  (4-block).  The  guaranteed  convergence  rate  for  the  algorithm  will  depend  on  this 
method  and  what  can  be  proven  about  the  relationship  between  7  and  /  (7).  This  relationship  is  the 
focus  of  the  next  section.  The  justification  of  the  above  algorithms  can  also  be  seen  there. 
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4.2  Properties  of  ^-iteration 

This  section  develops  some  properties  of  the  7-iteration  in  the  2-block  GDP  that  are  useful  in 
selecting  the  next  guess  for  7  in  step  (iv)  of  the  algorithm  in  the  previous  section.  The  guaranteed  conver¬ 
gence  rate  for  the  algorithm  will  depend  on  this  method  and  what  can  be  proven  about  the  relationship 
between  7  and  7. 

Recall  from  Theorem  1  in  the  last  section  that  for  a  given  7,  7  can  be  found  as  the  norm  of  the 

Hankel  operator  v_,.  Since  R  j  and  M  are  rational,  this  operator  is  finite  rank  and  7  can  be  found  as 

the  square  root  of  the  largest  eigenvalue  of  the  following  standard  eigenvalue  problem: 

=  x“  (SEP) 

or  equivalently  as  the  square  root  of  the  maximum  eigenvalue  of  the  following  generalized  eigenvalue 

problem: 

HSl*HRiu  =  M72/  -  v  (GEP) 

To  see  this  equivalence  note  that  since  Af~l  6  //»  .  w_,  is  equal  to  HrTu-i.  Hence, 

(SEP)  **  -  Xu 

=*  XTw*T^u 

=  XT^.T  uv 

=»  XTtf<A fv  (Sarason  (1967)) 

— ■  H.l*Hnlu  -  x<T-j, 

—  -  X(T*/-T,„tl.  (GEP) 

This  is  the  desired  generalized  eigenvalue  problem  (GEP)  and  the  dependency  of  the  (generalized) 
eigenvalues  on  7  is  clear.  Because  X  and  v  in  (GEP)  can  be  obtained  from  the  standard  eigenvalue  prob¬ 
lem  in  (SEP),  existence  of  eigenvalues  or  eigenvectors  for  (GEP)  is  assured.  The  eigenvalues  of  (GEP)  are 
functions  of  7  and  are  non  negative  for  all  7>  1 1  TRjR  J  |  W=*  1 |/?2|  I*.  We  shall  prove  that  XmM(7)  (and  its 


square  root)  is  continuous,  strictly  monotonically  decreasing  and  convex  in  7  where,  for  3  given 
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-7>  1 1 2 1  loo.  *n»*(7)  *®  defined  m  the  maximum  eigenvalue  of  (GEP).  Result*  stated  as  Theorem  7  show 
that  caD  be  bounded  by  simple  functions.  These  properties  can  be  used  to  guess  successive  7's  in 

a  way  that  should  quickly  converge  to  the  optimal  norm,  7, .  Theorems  8  and  9  give  the  conditions  for 
which  the  minimal  norm  is  achieved. 

A  key  observation  of  (GEP)  is  that  it  can  be  regarded  as  a  "perturbed"  generalized  eigenvalue  prob¬ 
lem.  Therefore,  the  perturbation  theory  of  generalized  eigenvalue  problems  for  a  special  case  which  can 
later  be  applied  to  (GEP)  will  be  considered  first.  The  results  can  then  be  used  to  prove  the  properties 
mentioned  above. 

Consider  the  following  generalized  eigenvalue  problem  for  Hilbert  space  operators  A  and  B , 

Av{t)  -  X(l)0(t)v(<)  l  €  (-s.e)  ,  t>0  (GEPl) 

where 

A  is  finite  rank,  positive  semi-definite  and  independent  of  t , 

and 

B(t )  is  bounded,  positive-definite  and  analytic  in  the  neighborhood  of  t  =0. 

Since  A  and  B(l )  are  self-adjoint  and  B(t  )>0,  it  is  well-known  (Kato  (1976))  that  by  appropriate  ord¬ 
ering  of  the  eigenvalues  {X,  }  and  selection  of  eigenvectors  {u,},  it  is  possible  to  pair  eigenvalues  and 
eigenvectors  {X,  ( t  ),v,  (t )}  such  that 

Av,(i )  —  X,(l  )B(t  )t>,  (/ ) 

for  all  t,  i  and  {X,  (<  )},{t>,  (f )}  are  analytic  for  all  t  €  (-£,£).  At  values  of  I  where  (GEPl)  has  simple 
eigenvalues,  this  is  trivial.  At  degenerate  points,  it  requires  the  selection  of  X,(f),  c,  (f )  such  that  the 
analyticity  is  retained  through  (isolated)  point  where  eigenvalues  coalesce.  Therefore,  X ( <  ),  u(i)  and 
B(l )  can  be  written  in  Taylor  series  expansion  as  follows: 
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.  (*.. 

x ,  +  t  x,+  — x,  +  .... 

(X.  -  X(0),  X.-  X(0),  ...) 

.  t  2  . . 

V,  +  tv,  +  —V,  +  . 

[v,  «*  v  (0),  v,=  ii  (0),  ...) 

and 


M  0 
»(0 


-  B.  +  tB,+  ±ti,  +  .  (B,  -  B(0),6.=  B(0),  ...) 


Substitution  of  these  expressions  into  (GEPlj  yields 


A(v,  +  tv,+  —v,  +  ...) 


(4-10») 


=  (X,  +  ...)(£,  +  tB,+  ^B,  +  ...)(ut  +tva+^-v,  +  ...) 


(4-11) 


<2  ..  ... 

Au,  +  tAv ,  +  —Av,  +  ...  (X,B,  v, )  +  t(B,  +  \J3,v,  +  \,Bjv,  +  X„fi,  uj 

(2  ..  ..  .... 

+  ~(X,  B,  v,  +  \,B,  v,  +\,B,v ,  +2\$<?), 

+  2\,  Bju,  +  2\p,  vj  + .  (4-12) 

Eq.  4-12  is  true  V  t  €  (-«,«)■  This  implies 

(i)  t°  (constant)  term: 

Av,  =»  X,  B,  v,  (4-13a) 

(ii)  tl  term: 

Av,  —  X£,  u,  +\,Bj>,  +  \,B,v,  (4-13b) 

(iii)  t 2  term: 

Av,  ■»  X,  B,  v,  +  X,  B,  v,  +\,D,v,  +  2(X<£„j/,  +  X ,  B,v,  +  X  fi,  vj  (4-13c) 

For  simplicity,  X,  has  multiplicity  1  and  I  =0  is  assumed  to  be  a  regular  point,  i.e.,  multiplicity  of  X(( )  is 
constant  V  <  6  (-e,e). 

Note  that  (u,  (<  )}  form  a  complete  set  of  eigenvectors.  Hence,  the  generalized  eigenvalue  problem 
considered  here  can  also  be  expressed  as  :  find  A(< )  such  that 


AV(t)  -  0(t)P(f)A(f)  (for  some  V(<)  jt  0) 


(GEP2) 


where 


HO  *  [y(0  n  (<  )j  and  A(f ) 


\(t )  0 

0  A,(< ) 


Lemma  1 

Assume  Vj_  *  4^  (0)  and  v,  and  B,  as  defined  in  Eqs.  4-10b  and  4-10c  ,  then 

v*B.  Vl  -  0  . 

Proof 

Consider  (GEP2)  at  f  **  0, 

AV,  =-  B,  K.A. 

«>  [(B,-VtYAB,-Vg[(B:nV,)  _  A,  (4- 14) 

where  B,*'*  is  any  nonsingular  matrix  such  that  (B,1’*),B,W  =  B, . 

Finding  A,  to  satisfy  Eq  4-14  is  a  standard  eigenvalue  problem  where  (B,  ~ln)*A  (B, )'  w  is  Hermi- 
tian  (in  fact,  nonnegative).  It  is  well-known  that  the  eigenvectors  of  a  Hermitian  matrix,  if  they 
correspond  to  different  eigenvalues,  are  orthogonal  to  one  another.  Therefore, 

(B.%,  )*B,xnVl  =  v*B,  VL  =  0  . 

QED 


The  following  theorem  gives  the  formula  for  the  first  and  second  derivative  of  the  eigenvalue  of 
(GEPl)  X(l)  at  t  -  0. 


Theorem  2 


Consider  the  (GEPl)  with  Taylor  series  expansion  as  given  in  Eos  4-10a,  4-10b,  and  4-10c.  Then 
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....  ••  ,  »,•&,  v,  v,*Bp, 

(11)  X,  -  — — -  +  2X,  ( — — - )* 


« ’,*B,  v, 

2X,2 

v*B,  u, 


* 'SB,  v, 

(Vi*  Bju.  )*((X.  /  - A±)( V  *  B.VJHV;  Bp, ) 


Proof 


(i)  To  get  the  expression  for  X  .consider 


v*  .  (Eq.  4-13b)  =  =  > 

v*Av, 

(Av,  B,  v,  )*v, 
0 


ip.*B,v,  +  \,v*Bp,  +  X,  v,*B,  v, 
^f*B,v,  +  \,v,*Bp, 
it),*B,v,  +  \,v,*Bp, 


X. 


-X. 


v*B0  v, 
V*B,  v. 


(ii)  To  get  the  expression  for  X,  consider 


V*  .  (Eq.  4-13b)  —  > 

Vj*  Av,  -  i,Vf  B,v,  +  X.  Vf  Bp,  +  X,  V*  B,  v, 


From  Lemma  1, 


(AVy  -\,B,vL)*h,  -  X.V7  Bp, 

(B,  FiAi  -X.B,  VL  )*i,  -  X,  V{  Bp, 

(Ai-X,/)(B,  Vx)*i#  -  X.  VA* 

Since  u,  and  the  columns  of  Vx  form  a  basis,  v,  can  be  expressed  as: 

v,  ™  av,  +  V ^  x 

where  a  is  a  scalar  and  z  is  a  vector.  Therefore, 

(B,  VL  )*u.  -  V  *  B,(av,  +  Vl  z) 

=  aVL  *B,  v,  +  Vj*  B,  V1  z 


(4-15) 


(4-16) 


=  (Fx*B.  Ka)z 


(4-17) 


Substitution  of  the  expression  of  Eq.  4-17  into  Eq.  4-15  yields 
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P! 


(A  i-\,I)(Vl*B.VL)x  -  X,  *Bju, 

*  “  X.^-X./KV'i  *5,  VL  )\-1Vl*Bju.  (4-18) 

Also  consider 
v, *  •  (Eq.  4-13c) 

v*Av,  —  X,  v,*B,  v,  +  X,  v,*B,  v,  +\,v,*B,v, 

+  2(\t)*8j>,  +\.v,,Sjj,+  \j>*B,v^ 

(Av,  B,  v,)*v,  ■»  X,  v/B,v,  +  X,  v,*B,  v , 

+  2(iy*6jo,  +  X,  +  \p,*B,  uj 

-  X,  v,*B,  V,  =*  X,  v,*B,  V,  +  2(Xj/,*B,t/,  +  X,  +  X^/B,  vj  (4-19) 

Finally,  substitution  of  the  expressions  of  X,  ti.and  x  into  Eq.  4-19  yields 

X,  K*fl„</,)  =«  -  X,  v,*B,  v,  +  2X,  —  )-. 

t»,*B,  tf, 

+  2X,S(  ^  )*[(X,  /  -  AI)(  VL  *B,  Vl  ))-*(  V,  *BA  )  . 

QED 


Remark 

Note  that  the  assumption  that  l  =  0  is  a  regular  point  does  not  cause  any  problem.  This  is  so 
because  X(t )  is  analytic  and  X(< )  (  X  (t )  )  must  be  continuous  and  X,(X,  )  can  still  be  computed  by  tak¬ 
ing  the  limit  of  X(< )  (  X  (< )  )  as  t  -*0. 

Define  a (f )  =  X  *(< ),  <t,  —  <t(0),  <?,—  <r(0)  and  <j,  **  <r  (0),  then  one  has  the  expression  for  <r(and  a,  , 
which  are  given  in  the  following  corollary. 


1  (  ».  3  ,  l2  2(7*  A 

--  i  -a. -  +  —<r,  ( - r  +  - ( K,*  )• 

2  l  vt*B,  v,  2  v,*B,  v,  vt*B,  v, 

l(\.l-AL  )(V*  B.KjHVf  BA))  • 


For  7  >  HBjIU  define  =  the  maximum  eigenvalue  of  (GEP)  at  a  given  7  and 

=  [X„„(7)|W  • 


The  following  two  theorems  are  the  main  results  of  this  section.  First,  X,^  and  a ^  are  shown  to 
be  strictly  monotonicaliy  decreasing  in  7- 


Theorem  4 


Consider  the  (GEPl)  as  given  above.  Then 


(i)  Xm  is  continuous  in  7. 
(11)  17 *»  continuous  in  7. 


Proof 


Let  t  ■■  7  -  7  and  B(t)—t2l  +  217/  +  (-p/  The  theorem  follows  immediately 


from  the  analyticity  of  eigenvalues  with  respect  to  t . 


Theorem  S 


Consider  the  (GEPl)  as  given  above.  Then 


(i)  Xggg  is  strictly  monotonicaliy  decreasing  in  7. 

(ii)  i*  strictly  monotonicaliy  decreasing  in  7. 


*:’  >v -v  y  v v  >v  ^ 


sv.\  v 


P  root 


Convexity  is  established  next. 

Theorem  ft 

Consider  the  (GEPl)  as  given  above.  Then 

(i)  t'amx  i*  convex  in  7. 

(ii)  Qm..  is  convex  in  7. 

Proof 

The  proofs  will  be  given  in  a  similar  way  to  those  in  Theorems  4  and  5. 


To  establish  (i)  note  that 


«  /V  ^  „ 
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ti,  **  27/  and  B,  —  2/  . 


Since 


( tf,  *Bji.  )2  _ 

-  va*B,  v,  +  2 — —  —  -2ti4*t>,  *  2(2-?)* 


v,*B,  v, 


v.*B,  v,  v,*B,  v, 


■(8?k.%.  -2i>4*04  u4) 


-  ?/  -  r./JlB  <  ?/ 


'pu/t'.  >  v,*B,  v, 
fr-fvSv,  -  -vt*B,  v,  >  0 

If  X4  is  the  maximum  eigenvalue  ,  then  X,  >  0.  It  follows  that  X„..  is  convex  in  i. 


■> 

>> 


To  prove  (ii)  note  that  From  Corollary  3-(ii), 
1 


.  ,  v*B,  y,  3  v*Bj>,  2o?  ^  ^ 

_  —  -  +  -*.(—75 - )2  +  — — - (  V*  B/>,  )• 

-  1  u„*fl4  t>,  -  v*B,  v,  v,*B,  v. 


K Kl-\L  B,  VL  )J'l(Vx*  Bju.)  J 


Since 


V,*B,  v,  3  v*Bju, 


+  —  <7 


v,*v. 


v,*B,  v,  2  ’  v,*B,  v,  v*B,  v. 


(672u,,i»,  -  2v,*B,  v, )  >  0 


If  Xs  is  the  maximum  eigenvalue  at  t —0,  <7,  >  0.  Therefore.  is  convex  in  7. 


QED 


Remark 

The  idea  of  using  generalized  eigenvalue  formulation  (GEP)  in  this  section  is  similar  to  that  in 
Helton’s  broadband  matching  problem  [H2|  although  the  motivation  here  is  completely  different. 


Although  the  function  <7,^,  is  unknown,  the  properties  shown  in  Theorems  4  to  6  have  provided 
some  useful  information  about  <7,^,  that  can  be  used  to  obtain  fast  convergence  of  the  7-iteration.  A 
detailed  study  of  convergence  rates  is  not  given  here,  however  some  guidelines  will  be  presented  which 
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show  how  the  properties  of  tr^.  can  be  used  to  find  a  next  guess  for  7.  One  additional  property  of  <7^  is 
useful  in  this  regard  and  will  be  presented  in  the  next  theorem. 


Define  <r„  (7,1)  —  where  c  —  ‘W'fK'T3  -  ff7  for  some  7  >  0(  —  |j/?2[|ao). 


Theorem  7 

Let  (Tj,,  and  <r„  be  defined  as  above.  Then 

(i)  aomxh)  <  (7,7)  *f  7  <  7- 

(“)  crn>»(7)  —  (7.7)  if  7  —  7- 

(iii)  arm«<7)  >  <\  (7.7)  if  7  >  7- 


Proof 


From  Corollary  3, 


(l2-})s/2 


TT(,n  9% ) 


7*  - 

7 


72  ~  02 


iv,*  v. 


1  ~  TR,r}V> 


where  v,  is  the  (generalized)  eigenvector  corresponding  to 


— (!n 


iv  *v, 


v.*\n  -  TRfR) v. 


Since 


RfR  2 


(T  -  D2)!  <  7 2I  -  T. 


•7\V1VTV\V'  ’.'•«/ 
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i  > 


■  >  vS(l2-f)Iv.  <  v.*h2l  -  T,t„)v, 


v*v. 


(72-^2)  »/(r/ -  r^n. 


•*•  “r~(ln  g„  )  <  -—(In  ^n*,)  <  0 
a  7  a  7 


Since  ^..(-t!  »»  gu  (7.7),  Eq.(4-20)  implies  that 

ln(g„ )  <  ln(<ramx)  if  7  >  7 
and 

ln(gw )  <  >n (aM.)  if  7  <  7 

The  theorem  follows  immediately. 


(4-20) 


QED 


The  importance  of  Theorem  7  can  be  seen  from  Figure  4-1.  Suppose  that  at  one  step  in  the  7- 
iteration,  we  have  evaluated  g^  at  7,  and  7,  from  previous  iterations,  and  want  to  make  a  new  guess 
for  7.  Assume  that  /3  <  7i  <  7«  such  that  g—.lir/ )  >  1  and  gnm*(7«  )  <  1.  From  Theorem  7,  we  know 
immediately  that  7,  <7,  <  7«  •  Since  (7m  is  a  convex  function  in  7,  g,,^,  must  lie  below  the  line  seg¬ 
ment  (denoted  by  Fj(7))  connecting  the  points  (71  ,<7n»x(7i ))  and  (7»  >gna»(7» ))  addition,  by  Theorem 

„  L  .L  ,  4.  ,  »  <7nm*(7f)(7 -  P)'*  L 

7,  ffai  w>>*  l*e  above  the  function  g„  (7,  ,7)  ™  - —5 — -5-J-5 -  when  7  >  7, . 

{7  -  r) n 

Suppose  that  7U '  and  7/  are  the  points  where  F\(lu'  )  *>  1  and  gu(7(,7,'  )  =  1.  We  can  con¬ 
clude  immediately  that  7,'  <  7,  <  7« ’  .  The  next  guess  for  7  is  narrowed  considerably  over  what 
would  be  known  on  the  basis  of  continuity,  convexity,  and  monotonicity  alone.  Thus  it  is  clearly  possible 
to  obtain  a  scheme  for  picking  the  next  guess  for  7  that  will  provide  rapid  convergence  to  the  optimal 
Further  consideration  of  convergence  rates  is  beyond  the  scope  of  this  report. 


Remark 


The  ^-iteration  can  be  viewed  as  the  problem  of  finding  the  zero  crossing  of  the  function 


(<rBli*(7)  -  !)• 


Figure  4-1.  7-iteration 


The  following  two  theorems  give  the  conditions  for  which  the  optimal  norm  7, ,  is  achieved. 
Define  F( 7)  -  R  t(i2/  -  RtR2Y'n. 

Theorem  8 


Assume  7  >  ||/?2||oc.  Then 


(Necessity ): 


If  UH^-jlJ  »  1  but  7  jL  7, ,  then  7  must  be  greater  than  7,  (since  7,  is  the  minimum  norm) 
From  Theorem  5,  ||H/.(-)||  is  strictly  monotonically  decreasing  with  respect  to  7.  Therefore, 

7  >7.  —  >  ||H,(,#)||  >  ||H,(-,||  -  1 

This  contradicts  the  definition  of  7, .  Hence,  7  ™  7, . 

(Sufficiency): 

If  7  «  7.  but  <  1,  by  continuity  and  monotonicity,  3  i,  >  0  such  that 

l|Hf(--,)I|  <  1  (where  7  —  7-«.) 

This  is  impossible  since  7.  is  the  minimal  achievable  norm. 

“  1  • 

QED 


Theorem  0 

Assume  7  *■  1 2(  loo.  Then  min  I  IF (7)  -  Q  1 1  <  I  if  and  only  if  7  —  7, . 

0  e  mrz 


Proof 

(Necessity)  : 

<  l  implies  7  >  7, .  Together  with  the  lower  bound  (Theorem  4.1.  2)  7t  >  1 1«  all., 
therefore,  7,  *=  7. 

(Sufficiency)  : 

This  is  obvious  from  the  Theorem  4.1.1. 


Il^lloo  and  |  [  <  1  if  and  sly  if  7  —  yt . 


4.3  An  Example 


lo  this  section,  a  simple  example  with  a  single  parameter  is  constructed  to  illustrate  various  proper¬ 
ties  in  the  7-iteration.  An  exact  optimal  solution  will  be  derived  in  detail. 


Consider  the  following  2-block  problem: 


7. 


min 

<3  e  bh. 


1 


t-l 

1 

i  -a 


-  Q 


(a  >  0) 


(4-21) 


oo 


Let  Qt  be  the  optimal  solution  which  achieves  the  minimum  norm.  Using  the  formula  in  Section  3, 

lr  -  vl  II 

<  7 


9  -1 

1 

9  -a 


(-V  -  Q)M-1  II  <  1 

8  — 1  oo 


#-l  _  iJ2.  1  -  V2  __  _ 8 


where  M~x  (-y2 - 2 - — )-  _ - _ 

-» -a  #-a  + 


l+a 


Assume  G  =  (— W'1),.,,.,,, 


7  +  V7V-I 


»-l 


and  consider  the  following  best  approximation 


problem: 


.  min 


II  <?  -  <?  |U 


It  is  not  difficult  to  solve  this  problem.  The  minimum  norm  is 


.  min 

Q  €  BH, 


G  -  Q 


and  the  optimal  solution  is 


Q.  = 


1  +  0 


2  7  +  V  72a2-l 


(4-22) 


l  +  a 


i(- 

2  7  +  V^T 


(4-23) 


Of  cours«,  in  order  to  have  Eq.  4-22  make  sense,  the  Hankel  norm  of  G  must  be  less  than  or  equal 


to  1.  Therefore,  it  is  reasonable  to  assume  -L - -■  1  and  solve  for  7.  Once  7  is  found,  sub- 

-  7  +  v  7‘a  *-l 

stitution  of  it  into  Eqs.  4-22  and  4.23  leads  to  the  optimal  solution  and  minimum  achievable  norm  of  Eq. 
4-21. 

* 

Let 

-L  | _ L+JL—)  =m  1 

2  7  +  'J'fa.  2-l 

1  +  a  —  2(7  +  V  7^a  z— 1 ) 

(l+a)-27  =*  iVTtt 

Taking  square  on  both  sides  and  collecting  the  terms  with  same  power  in  7  yields 

4(1  -  a  2)72  -  4(l+a  )7  +  [4+(l  +  a  )*]  =  0  (4-24) 


There  are  three  cases  to  consider. 

li  9  —  1 

In  this  ease,  Eq.  4-24  is  simply  a  linear  equation  and  has  solution  7=1.  Therefore, 

7,  =  l 


If  a^l,  Eq.  (4)is  a  quadratic  equation  and  the  conditions  such  that  a  positive  real  solution  exists 

are: 


li) 


4(l  +  a  ) 


>  0 


a  >  1 


Therefore, 


(7i7Arl)*t“" 


7a  -  vVas-l 
7  +  V7  a  2-l 
7 1  +  V,72a2-1 


<3.  =  {(<?.)  +  }  •  A/'1  ^ 

78  -  \/ 72a 2-l 

=  1+a  >  7*  *  vVa2-l  7  *  vVa2-! 

2  7  -r-  a  7*  a  »  +  a 


(4-25) 


Substituting  the  optimal  7,  into  Eq.  4-25  separately, 


(0  »  <  1 

(»'»  )  8*1 

(in  )  a  >  1 


(1  +  a  )e  +2a 
2(a+a) 


Q.  -  -I- 


Q, 


7.  *  +  7. 


+ 


a  -1 

o 


a  +a 


where  7, 


2(8-1) 


-1  + 


V- 


a’  +  A?-n 

a  +1 


(4-26) 

(4-27) 

(4-28) 


Remark* 

The  significance  of  this  example  can  be  stated  as  follows: 

(i)  From  Eq.  4-22,  the  Hankel  norm  of  G  is  a  convex  function  of  7.  This  can  be  verified  by  comput¬ 
ing  its  second  derivative  with  respect  to  7  and  showing  that  it  is  always  greater  than  zero. 

(ii)  If  0  <  a  <  1,  then  7,  =  1 1  2 1  loo  -  This  means  that  the  lower  bound  in  Theorem  4.1.2  is  tight. 

(iii)  If  0  <  8  <1,  then  the  optimal  solution  is  not  unique.  Recently,  Professor  Bruce  Francis  showed 


a  family  of  optimal  solutions  which  includes  the  one  given  in  Eq.  4-26  [F6|.  However,  if  a  >1,  it 
can  be  shown  that  the  optimal  solution  (Eqs  4-27  and  4-28)  is  unique 
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(ii)  |4(l+a  J2]2- 4|4(l-a2))  [4  +  (1+a  )*)  >0  a*  +  a2  +  4a  -  4  >  0 


Cm  2t  a  >  1 


It  is  easy  to  see  that  (i)  and  (ii)  are  satisfied  automatically  in  this  case.  Hence,  in  (4)  will  have  a  real 
positive  solution  if  a  >  1  and  the  solution  is 


1 

2(a-l) 


seu 

a  +i 


Cue  3t  a  <  1 

In  this  case,  Eq.  (4)  has  no  positive  real  solution.  But  observe  that  if  0  <  a  <  1 


1  ,  1+a 

2  1  y/TTTtt 


)  <  1  V  7  >  - 


where  — 
a 


»  -a  "oo 


.  Therefore,  the  smallest  possible  value  of  7  is  simply  — 


To  summarize. 


(') 

(») 

(iii) 


a  <  1 

a  I 

a  >  1 


1. 

1. 


2(o-l) 


-1  + 


/  ;  ,  4(a~T 

V8  s+1 


»_Becoytry  of  the  optimal  solution! 


Note  that 


CHAPTER  5 

OPTIMAL  STATE-SPACE  SOLUTIONS  OF 
GENERAL  DISTANCE  PROBLEMS 


Id  this  chapter,  the  general  distance  problem  is  treated  in  the  context  of  the  real-rational  transfer 
function  matrix  (/?(*)  6  #1*,)  where  of  the  half-plane  instead  of  the  unit  disc  is  used.  Both  two- 
block  and  four-block  problems  are  considered  separately.  Since  any  real-rational  transfer  function  matrix 
has  a  finite-dimensional  state-space  representation,  it  is  shown  that  the  corresponding  GDP  can  be  solved 
using  attractive  state-space  methods  involving  only  standard  real  matrix  operations. 

Recall  that  the  GDP  can  be  simplified  to  an  equivalent  best  approximation  problem  if  the  value  of 
7  is  chosen  properly  (Theorems  3.4.1  and  3.4.4)  where  the  spectral  factor  of  the  para-Hermitian  matrix 
with  the  form  (7 ‘I  -  G*G  )  (or  (7 s/  -  GG*)j  is  used.  Since  the  stability  of  G  does  not  affect  the  result 
of  the  spectral  factorization,  i.e.,  the  poles  of  G  in  the  open  rhp  will  be  replaced  by  their  counterpart  in 
the  open  thp  ,  it  would  be  convenient  to  assume  G  €  ft//®.  If  this  is  not  the  case,  a  coprime  factoriza¬ 
tion  of  G  with  inner  denominator  can  always  be  found  and  the  numerator  matrix  can  be  used  in  place  of 
G .  This  particular  coprime  factorization  is  presented  in  Section  5.1  which  involves  finding  the  stabilizing 
solution  of  an  ARE.  Since  the  constant  term  in  this  ARE  is  identically  zero,  the  stabilizing  solution  can 
be  obtained  more  efficiently  by  a  "modified'  Schur  method  which  is  no  more  than  finding  the  Schur 
decomposition  and  solving  a  set  of  linear  equations.  This  is  discussed  iv  Section  5.2.  In  Section  5.3,  the 
state-space  formula  of  spectral  factors  (7 2l  -  G*G)yi  and  (t2/  -  GG*)yi  is  derived.  Once  the  GDP  is 
simplified  to  the  corresponding  best  approximation  problem,  Glover’s  algorithm  (G2)  can  be  applied  to 
find  the  solution  which  is  reviewed  in  Section  5.4.  Finally,  combining  the  state-space  formula  of  factoriza¬ 
tions  and  the  best  approximation  mentioned  in  Sections  5.1  through  5.4,  the  optimal  solution  for  the 
GDP  is  obtained  in  terms  of  the  "closed-form"  state-space  realization. 
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5.1  Coprime  Factorizations  with  Inner  Denominator 


In  this  section,  the  coprime  factorization  with  inner  denominator  will  be  developed.  Explicit  state* 
space  realizations  will  be  given  which  involves  solving  an  algebraic  Riccati  equation.  Without  loss  of  gen¬ 


erality,  it  is  assumed  that  G  —  L,  £  Rfxm  and  the  realization  is  minimal. 


Theorem  1 

Consider  G  £  Rfxm.  Then  there  exists  a  ref  G  =■  NM  "l  such  that  A/  £  RH*xm  is  inner  if 
and  only  if  G  has  no  poles  on  the  j'l^-axis.  A  particular  realization  is 

'v/1  \a+bf\b 

‘v  =  FI  £  RH<0m+’)xm  (5-1) 

C+DF  D 


F  -  -  BtX 


A  ~BBT 

X  -  Rle  0  -X'  2  0 


Proof 


(Sufficiency): 


Assume  G  has  no  poles  on  the  j  u>-axis,  therefore,  Re[X,(A)|  ^  0.  It  is  known  that  the 
ref  G  =*  iVW  "*  can  be  obtained  using  state-feedback  gain  F  such  that  (.4  +BF )  is  a  stability  matrix 
where  Af  and  N  have  the  realization  as  in  Eq.  6-1.  Since  A/  is  required  to  be  inner,  the  following  two 
equations  must  be  satisfied  for  some  X  (Lemma  2.3.3): 


BtX  +  F  -  o 


Note  that 


(Eq.  5-4) 


(/I  +BF)~ X  +  X(A  +BF)  +  FtF  -  0 


F  «  -  B~ X 


(Eq.  5-5)  A  T  X  -  XBB  7  X  +  XA  -  XBB 7 X  +  XBB  7  X  «*  0 

«*>  A7X  +  XA  -  XBB7 X  -  0.  (5-6) 

Eq.  5-6  is  an  algebraic  Riccati  equation.  Since  (A  ,B)  is  a  controiiable  pair  and  RejX,  (A  )]  ^  0,  Eq.  5-6 
has  a  unique  stabilizing  solution  X  >  0. 


(Necessity): 


GG *  =■  NM  ~l[NM  '*)•  »  MW  -1( A/  "l)*M*  =»_MN*  . 


Since  N  6  RHV,  GG*  has  no  poles  on  the  j i>axis.  Therefore,  G  has  no  poles  on  the  j w-axis. 


Similar  results  for  the  icf  can  be  established  easily  by  duality. 


Corollary  2 


There  exists  a  fc/  G  **  M  ~lN  such  that  A/  g  /?//£,*’  is  inner  if  and  only  if  G  has  no  poles  on 


the  ;u/-axis.  A  particular  realization  is 


["  "] 


A  +HC  H  B  +HD 
C  l  D 


e  RH’0*if+m) 


A7  -CTC 

>'  -  R'c  0  -a  5  °  • 


.»  •  -  *  *  •  •  •  4  •  "  ►  ’  e  ■  *  ’  *  *  «  •  »*•  *'*•«.  ’  •  •  •  . '  ■  t''  . 


133 


Remark* 

(i)  The  minimality  condition  on  G  is  not  necessary  and  can  be  weaken  to  that  the  pair  (A  ,B ) 
((C,A  ))  is  stabilizable  (detectable)  and  Re[X , (>4  )]  ^  0  in  Theorem  1  (Corollary  2). 

(ii)  Note  that  the  constant  term  in  the  ARE  (Eq.  5-6)  is  identically  zero.  Although  Eq.  5-6  can  be 
solved  by  the  existing  algorithms,  for  example,  the  Schur  method  in  [Ll|,  a  simpler  and  more 
efficient  algorithm  will  be  presented  in  the  next  section  for  this  class  of  ARE. 


-  >  / 

aJl » 


6.2  A  Lyapunov  Approach  for  Obtaining  the  Inner  Denominator 


In  Section  5.1,  the  coprime  factorization  with  inner  denominator  is  obtained  where  an  ARE  is 
required  to  be  solved.  In  this  section,  a  more  efficient  algorithm  will  be  developed  by  taking  advantage  of 
the  block  upper  triangular  structure  of  the  associated  Hamiltonian  matrix  (Eq.  5-3  or  5-9).  The  computa¬ 
tion  involved  is  much  simpler  compared  to  that  in  the  general  Schur  method  where  the  Schur  reduction  of 
a  2n  X2n  matrix  is  required  {Llj.  The  idea  of  invariant  subspace  corresponding  to  the  eigenvalues  in  the 
open  Ihp  will  be  used.  Without  loss  of  generality,  only  the  right  coprime  case  is  treated  here. 


Consider  the  Hamiltonian  matrix  which  arises  in  the  ref  with  inner  denominator. 


A  -BBT 
0  -A  T 


where  Re[X,  (A  )|  jL  0,  »  =  1,2 . n  and  the  pair  (A  ,B)  is  stabilizable.  The  algorithm  is  composed  of 

the  following  steps: 


9isfi-L 


Find  the  upper  Real  Schur  Form  of  AH  such  that  the  eigenvalues  with  negative  real  parts  appears 
first,  i.e., 


UT  AU 


A,  A„ 

0  A„ 


where  A,  and  -.4,  are  stable  matrices  and  U  =  ^C/,  Uu  j  is  an  orthogonal  matrix.  Note  that 
the  columns  of  U,  are  the  Schur  vectors  corresponding  to  the  eigenvalues  with  negative  real  parts. 


SisaJ. 


Apply  the  similarity  transformation  T  t 


to  Ah  : 
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UtAU  -UtBBt u 
0  -UtAtU 


A,  A,u  -Bt  B,  -B,  Bj 
0  A,  -Bu  B,t  -Bu  Bj 
0  0  - A /  0 

0  0  -A*  -Aj 


where  Bs  =*  U/B  and  Bu  =*  UjB  . 


Solve  the  Lyapunov  equation: 

Au  Z  +  ZAj  =  Bu  B„r  .  (5-10) 

Since  the  eigenvalues  of  Au  are  in  the  open  rhp  ,  Eq.  5-10  has  a  unique  solution  Z.  From  the 
assumption  of  the  stabilizability  of  (/I  ,B ),  the  pair  (Au  ,BU  )  must  be  controllable.  Therefore,  Z  >  0 
is  concluded  from  the  Lyapunov  stability  theorem. 


Step  1 


Define 


T2 


10  0  1 

0  /  0  ~z 

0  0/0 

0  0  0  / 

>  < 


Apply  similarity  transformation  Fj  to  TxAhT\~x  to  get 


4, 

-A^-B,  B,t  -Aj-As  +A„Z-B,Bj 

0 

-B„  B,r  +  Z A, l  0 

TATxA„Tx-')T2-'  = 

0 

0 

-a,t  0 

0 

> 

0 

j  r  _  A  T 

Apt 

Step  5 


Form  the  third  similarity  transformation  T  ,  as 


/  0  0  0 
0  0  0  / 
0/00 
0  0/0 


A,  -Aj  -A,  +  A„  Z  -  B,  Bj  An  -Asl~Bs  B,T 


r3(r2r1Aw7Yl7y1  )rs- 


-b„b,t  +  zaZ 

o  - A,r 


Step  0 


Define  T  »■  T s7 jT lt  then 


y-i  _  r^TY^s"1 


/  0  0  /  /  0  0  0 

f/olo/oz  00/0 

Of/  00/0  000/ 

0  0  0  /  0  /  0  0 


Note  that  the  left  upper  nX»  matrix  in  Eq.  5-11  is  stable,  therefore,  the  stabilizing  solution  A'  is 
simply 


U  0  I  i  i  r  0  z 


o  o  [tv 


o  z  1 1  r. 


»  ^  .  l  •  »  *  .X  \  *  ■«  %  *,  V  *.  *,  "  V  V  *  »  •  •  «  »  ’  « 


S.3  Spectral  Factorisation* 


In  this  section,  without  loss  of  generality,  G  (r  )  is  assumed  to  be  stable  since  any  G  €  RL^  may  be 
factored  using  Corollary  5.1.2  to  obtain  a  stable  numerator  N  such  that  N*N  »  G*G .  The  objective 
now  is  to  derive  a  state-space  formula  for  the  spectral  factor  of  (72/-G*G  )  (or  (t2/ -GG •)). 


Theorem  1  (Spectral  Fs^torlsation) 

Assume  G  (#  )  6  #//£,*"  and  7  >  ( | G  (»  )  j (go.  Then,  there  exists  a  M  6  RH**m  with  stable 
inverse  such  that  A/*Af  *  72/  -  G*G  with 


where 


Rd  «  72/  -DtD  >  0 
Kc  -  -RD-\BT  X  -DT  C) 

and 

A+BRd-'DtC  -brd'bt 
X  -  Ric  cT(l+DRplDr)C  -(A+BRd-'Dt  C)T 

Proof 

Let 


r  »  72/  -  G*G 


[fl  r  (-«/  -A  r  )'1 


-CT  C  -CT 
-DTC  Rd 


D 


Since  7  >  |(G  ((*,  r(j  w)>0.  The  minimality  of  the  realization  of  G  guarantees  that  (A  ,  B )  is 
controllable  and  (-Gr  C  ,  A  )  is  observable.  Thus,  from  Corollary  2.3.2,  there  exists  A/  €  RHZ  'm  and 
A/'1  g  /?//*  such  that  f  =  A/*  A/  and  a  particular  realization  is 


Kc  -  -  RD~\BT  X  -  DT  C) 


and 


-Y 


Ric 


A+BRolDTC  -BRd-1Bt 

Ct(I+DRd-'Dt)C  -(A+BRd-1Dt  C)t 


Since  G  is  stable,  it  is  concluded  that  M  g  /?//£**  . 


QED 


The  following  corollary  is  the  dual  result  of  Theorem  1. 


Corollary  2 

With  the  same  assumptions  as  in  Theorem  1,  then  there  exists  a  M  g  /?//£,* p  with  stable  inverse 
such  that  M  M  •  »  y2/  -  GG*  with 


M 


A 

C 

p  to 

R  D 

(5-13) 


where 


Ro 

Kf 


l2I  -DDT  >  0 
-  ( YC  T  -  B  T  D  )R  5l 


and 


Y  -  Rlc 


(a+bdt  r  s'cy  -ctr  b'c 
B  (/ +D  T  R  pD  )B  r  -(A+BDT  R  B'C] 


Remark 


The  notation  (7*/  -  G*G  (  (7s/  -  CG*)W  )  will  be  used  to  denote  the  spectral  factor  M  (A?)  in 


Theorem  1  (Corollary  2). 
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Theorem  3 

If  7  =  ||G(»)||oo  but  7  >  5[C(oo)j  in  Theorem  1,  then  there  exists  a  M  6  RH^xm  with  A/'1 
analytic  in  the  open  rhp  such  that  M*M  =  -j2l  -  G*G  with 


.4 

B 

_  -  Rdl‘K<  1 

!  d 

1 

where 

rd  -  r/  -  dt  d  >o 
A,  =  -  Rd'(Bt  X  -  DT  C) 

and  A'  is  the  unique  solution  to  the  ARE 

(A  +BRd'Dt  C)T  X  +  X(A+BRd-1Dt  C)-XBRd-'BtX  +  CT(I  +DRd-'Dt)C  «  0(5-14) 
such  that  RejX,  (.4  +BKC )]  <0,  Vi. 

Proof 

Let 

r  =.  72/  -  G*G  -  [flr(-»/-A  r)-‘  /] 

By  assumption,  r(;w)  >  0  and  RD  =  T(oo)  >  0.  Since  (A  ,B)  is  controllable,  from  a  theorem  of  J.C. 
Willems  [Wl|,  there  exists  a  unique  solution  X  to  Eq.  5-14  such  that 

A  +  BRd-1DtC  -  BRd-'BtX  =  A  +  BKC 

has  no  eigenvalues  in  the  open  rhp  . 

QED 


-CTC  -CT D 

(si -A  )-‘J5 

-DtC  Rd 

/ 

Corollary  2  can  be  generalized  in  a  similar  way  to  the  case  where  7  =>  j|C  He  but  7  >  <r|C  (oc )) 


such  that  M  1  is  analytic  in  the  open  rhp  . 


5.4  Balanced  Realisation!  and  Best  Approximation  Problems 


Id  Section  3.2,  it  was  shown  that  there  exists  an  optimal  solution  Q  6  Hx  to  Eq.  3-14  such  that 
1 1/?  -  Q  1 1®  =  ||H/f  ||.  The  same  equation  will  be  reconsidered  in  this  section  for  real-rational  R  .  The 
idea  of  balanced  realization  will  play  a  key  role  in  this  problem. 


Recall  that  a  stable  transfer  matrix  G  (t  )  with  minimal  realization 
there  exists  a  diagonal  matrix 

E  =*  diag  (<7j,  o2 . .  <r, )  /Tj  ><t2  >  .  >  <7 

such  that  the  following  two  Lyapunov  equations  are  satisfied: 

A  E  +  EAr  =  -  BBt 


A_ 

C 


n 


_5 

D 


> 


0, 


is  called  "balanced"  ft 


and 

A  r  E  +  E/t  =  -CTC. 
where  {<7,  }  are  called  the  "2nd  order  modes"  of  the  system  |M4,P2]. 


If  G  (»)  is  completely  unstable  (i.e.,  no  eigenvalues  in  the  closed  Ihp  ),  then  G  (s  )  is  said  to  be  bal¬ 
anced  if  G  (-»  )  is.  The  method  for  obtaining  balanced  realization  is  not  unique  which  can  be  found  in 
[El,L2,M4|.  Note  that  balanced  realization  is  also  very  useful  in  model-reduction  |El,G2,K7j. 


The  following  lemma  shows  the  relation  between  the  norm  of  a  Hankel  operator  and  the  balanced 
realization  of  a  completely  unstable  transfer  function  matrix  G . 

Lemma  1 

Assume  G  is  real-rational  and  completely  unstable  in  Eq.  3-14.  Then 

II  Hc  ||  =  <7, 

where  <7,  is  the  largest  "2nd  order  mode’  of  G  . 
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This  lemma  is  very  useful,  since  it  relates  that  the  Hankel-norm  can  be  computed  easily  using  state* 
space  methods.  In  fact,  even  if  the  realization  is  not  balanced,  the  norm  is  equal  to  Xj^JVY)  where  X 
and  Y  are  the  controllability  and  observability  gramians  respectively  and  can  be  computed  by  solving  the 
corresponding  Lyapunov  equations: 

AX  +  XA  T  =*  BB T 

and 

« 

AtY  +  YA  =■  CTC. 

Now,  consider  the  best  approximation  for  the  real-rational  case: 

7*  “  <}“!&*,  11  *  ~  Q  ll<x  ’  *(-•)€/?//..  (5-15) 

The  question  is:  does  Eq.  5-15  have  a  solution  which  is  also  real-rational?  The  answer  is  yes  and  the  proof 
can  be  found  in  [A1,A2,B3,D16,G2|. 

Theorem  S 

If  R  is  real-rational  in  Eq.  5-15,  then  there  exists  a  best  approximation  Q  6  RHX. 


Theorem  3 


Assume  that  Q,ft  is  the  best  approximation  in  Eq.  5-15,  then 


(i)  if  p  =  1  or  m  =  1,  Q,pl  is  unique  and  — (G  -Qip t)  is  all-pass. 

(ii)  if  p  l  and  m  ^  1,  then  Q,pt  is  not  unique  and 


7, 


c -  1  V  u/€  F 


There  are  various  algorithms  available  to  solve  this  problem  [Bl,B3,Dl6,G2j.  However,  the  Glover  s 


algorithm  (G 2]  appears  to  be  the  most  efficient  one  so  far  and  will  be  reviewed  next. 
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Without  loos  of  generality,  R  is  assumed  to  be  strictly  proper,  completely  unstable  (the  stable  part 
of  R  can  always  be  absorbed  into  Q  )  and  square  (if  not  square,  add  rows  or  columns  of  zeros).  Let 


R  =  ^(Tjo'J  **  a  minimal  realization.  Then  the  optimal  Q  €  /?//«,  can  be  constructed  using  the  fol¬ 
lowing  steps: 


Step  1 


Find  a  balanced  realization  of  R 


Thus  both  the  controllability  and  observability  gramians  are  diagonal  and  equal  to 


a  l,  0 
0  E 


where  a  >  ||E||,  i.e.,  r  *  multiplicity  of  <T.  Partition  A  ,  B ,  and  C  accordingly  as 


-4U  B  l  r  -1 

A  =  [a2,  An  '  B  =  |fl2]  ’  and  C  “  l^1  C*\ 


Step  2 


Choose  D  such  that 


D  Bj  +  <7CX  = 
D  D  r  -  a2!. 


Step  3 


B  -  +aC?D) 

A  -  Ma  +  B2fir)r 


-A  * 


6.S  Stnte-Spnce  Solutions  of  General  Distance  Problems 


Combining  the  results  from  previous  sections  of  this  chapter,  in  this  section,  the  state-space  realiza¬ 
tion  of  the  optimal  solution  of  the  general  distance  problem  will  be  derived  step-by-step.  Although  the 
formula  may  not  look  very  simple  in  its  appearance,  the  implementation  is  quite  straightforward.  Once 
again,  the  2-block  problem  will  be  considered  first,  and  the  results  are  then  generalized  to  the  4-block 
problem. 


Without  Io6s  of  generality,  both  Rx  and  /?2  are  assumed  to  be  completely  unstable  with  the  realiza- 


^1  ~  Ci  Di  ^2  “  c  j) 


where  the  realization  R2  is  minimal.  The  solution  can  be  found  from  the  following  steps: 


Step  j  (  Left  coprime  factorization  with  inner  denominator  )  [Corollary  5.1.2J 


R2  =,  A/  "‘jV  with  N  = 


A  B 

CD  ^  RH~ 


where  A  =  A  +  HC  ,  B  =*  B  +  HD  and  H  is  the  observer  gain  such  that  A  is  asymptotically 
stable. 


Step  2  (Spectral  factorization  of  {i2!  -R*R  2()  [Theorem  5.3. l| 


Find  the  spectral  factor  M  such  that  A i*\i  =■  ~\2l-S*N  =»  -j2l  -  /?,*/?- 
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where 


M 


A  B 

-  Rd*K<  \Rd* 


Rd  -  7 2I-DtD 

Kc  =-  -  Rd-\Bt  X-Dt  C) 


X 


Rie 


'  A  +  B  RolD  T  C  -B  RD-lB  t  1 

C  T(I  +DRplD  T)C  -(A  +  BRd1DtC)t  | 


where  Ric  (Ah)  denotes  the  stabilizing  solution  of  the  Riceati  equation  with  the  associated  Hamil¬ 
tonian  matrix  Ah  ■ 


Step  3 

Form 

G  -  RtM~l 


■^1 

fl,' 

'a  +bkc\ 

brd 

= 

01. 

* 

Kc 

Rd’* 

A  i  B\KC 

BiRd-"' 

= 

0  A  +BKC 

brd-w 

Cy  DXKC 

b 

DiRd'1* 

Step  4  (  Spectral  Decomposition  ) 


(i)  Solve  the  following  Sylvester  equation  for  Z : 

-AXZ  +  Z(A  +BKC)  -  -BXKC  . 

Since  both  -  (-4  i)  and  (A  +  B  Kc )  are  stable,  Eq.  5-22  has  a  unique  solution 


(ii)  Conjugating  the  states  in  Eq.  5-21  by  T 


l  Z 
0  / 


I 


(M2) 


>1,  0  (B ,  +  ZB  )Rd~  1/4 

0  A  +BKC  BRD'in 

C,  DxKe  -  c\z\  D  lR[>~ 1/2 


^  1 1  (Si  + 

Cj  0  + 


Gy  (7, 


‘  A  +  BKC  BRD- lfl 
D iKe  -  CiZ\DiRD-xn 


where  (7,  is  stable  and  G „  is  completely  unstable. 

Step  6  (  Computation  of  the  Hankel-norm  ) 

Compute  the  Hankel-norm  of  Gv  (»  ),  called  /  {if),  which  is  equal  to  [A^IV,  W,  )|Vj  where  Wc  and 
W,  are  the  controllability  and  observability  gramians  of  Gu  (-» )  respectively,  i.e.,  W c  and  W,  are 
the  solutions  of  the  following  two  Lyapunov  equations 

AtWc  +  WcAf  -  (B ,  +  ZB  )Rol(B ,  +  ZB  )T 


A[W,  +  W,  A ,  -  C,rC, 


respectively. 


(a)  if  /  (i)  >  1,  it  means  that  the  value  of  i  is  too  smalt.  A  larger  i  should  be  chosen,  and  the  pro 
cess  will  restart  from  9tep  X  . 

(b)  if  /  (i)  *  l,  from  Theorem  4.2.8,  the  corresponding  i  is  the  minimal  achievable  norm  in  (2gdp). 


(c)  'f  /  (l)  <  1,  From  Theorems  4.2.5  and  4.2.9  two  cases  can  occur: 


(')  7,  *  r 


m.~C  »  -  ■'  _  ~~  ‘ 


(ii)  7.  <  7- 


Case  (i)  can  happen  only  when  7,  =  1 1/? 2|  loo-  It  can  always  be  detected  by  letting  7  =  1 2 1  |oo  +  «, 
in  the  beginning  of  the  iterative  process  where  c,  is  a  very  small  but  positive  real  number,  for  example, 
MT*.  If  the  corresponding  /  (7)  <  1,  then  7,  must  be  equal  to  1 1-/? 2! |oo-  This  special  case  will  be  dis¬ 
cussed  later  in  this  section.  Assume  that  (ii)  is  the  case,  it  means  that  the  value  of  7  is  over-estimated. 
The  value  of  7  should  be  decreased.  Then  the  process  will  restart  from  Step  2  . 


Step  0  (  Best  Approximation  )  [Section  5.4] 


If  /  (7)  **  1,  Glover's  algorithm  which  was  reviewed  in  Section  4  can  be  used  to  find  the  best  approx¬ 
imation  Q  ,  €  RHn o  of  (7U  .  Assume  Q  ,  has  the  following  realization: 


Aq  Bq 
Cq  Dq 


Sisal 


(i)  Form 

A  +  BKC  I  B  RD~  w 


)XKC  -  ClZ\DlR 
A  +BK,  -B 


D  XKC  -CXZ  -  D  xKe\  D  x 

(ii)  Conjugating  the  states  in  Eq.  5-23  by  T 

A  +  BK.  0  I  0 


Ro*K c  I  V2 


-  CXZ\DX 


CQ  -  Dq  Rd^K,  I  Dq  Ron 


Note  that  Eq.  5-24  can  be  written  as 


Therefore, 


U<,  -  bq  r^kc  I  bq  rA 


_ A _ B_ 

~cxz  dT 


Q.  -  {Q.  +  G,)M 


Aq  ~  Bq  R^Kc  BqRd w 

_0 _ A _ B 

CQ  -Dq  Rp^K'  -  C.ZIDqRd^2  +  Dx 


If  7.  “  I  \R  2 1  loo  and  7,  >  ^(/?2(°c)|,  then  Step  1-8  can  still  be  used  to  find  the  optimal  Q, ,  the 
only  difference  is  in  Step  8  where  the  factor  M  will  have  zeros  on  the  /w-axis,  i.e.,  A  +  B  Ke  has  no 
eigenvalues  in  the  open  rhp  .  Since  (A  +  B  K e )  does  not  appear  in  Eq.  5-25,  the  corresponding  Q,  is 
still  in  RHac-  If  7,  ■■  li^zlloo  but  7,  »  7(/?j(oo)|,  then  A/  does  not  have  full  rank  at  00,  which  is 
equivalent  to  saying  that  M  has  no  inverse  in  RLX.  Therefore,  the  7-iteration  doesn’t  apply  to  this  par¬ 
ticular  problem.  Although  R  j(oo)  can  always  be  perturbed  by  some  e>0  and  use  the  method  described 
here  (Step  1-8),  there  is  no  theoretical  support  for  this  argument.  Further  research  needs  to  be  done  in 
this  special  case.  Theorem  3.2.2  is  therefore  proved  except  the  last  case. 


*  /  .*■  ,*  .«  «*  .*■ 


151 

The  4-block  case  is  much  more  complicated  than  2-block  case.  However,  a  closed  form  solution  can 
still  be  found.  Without  loss  of  generality,  R  is  assumed  to  be  completely  unstable  (Theorem  2.3.6).  Due 
to  the  complexity  of  the  problem,  the  realization  in  the  first  two  steps  will  not  be  shown  which  are  not 
critical  in  the  process. 

The  solution  can  be  found  from  the  following  steps: 

Step  1 

(i)  Find  factors  S  =  (y*!  -R&R  22)w  and  S  =»  (y1!  -  R^R^)1*. 

Note  that  5  and  S  need  not  be  spectral  factors.  See  the  Remark  (ii)  after  Theorem  4.1.3. 

(ii)  Form  L  —  and  L  =  S'xRiv 

(i)  Find  the  ref  with  inner  denominator  for  L  , 

L  =  Nl  Mfl  . 

(ii)  Find  the  lef  with  inner  denominator  for  L  , 

L  -  Mr'Nc  . 

Assume  that  /Vt  and  N[  have  the  following  minimal  realizations: 


Step  3  (  Spectral  factorization  ) 

(i)  Using  Theorem  4.1.1,  find  the  spectral  factor  M  such  that 

MM*  -  I-Nl  Nl*  -  /  -LL*  . 
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where 


A. 

-  K,  R  F 

* 

R%* 

(ii)  Find  the  spectral  factor  M  such  that 

M*M  —  I-N?Nc  =*  I~L*L 

where 


M 


A 2  B  2 

-RonKe  Rp* 


For  simplicity,  the  expressions  for  Kc,  Kr  ,  RD  and  R  D  will  not  be  written  down  explicitly. 


Step.  .4 


(i)  Form  the  linear  fractional  transformation 


-  -(/?„  +  R^'l  -R&RvY'R&Rn ) 
7  7  7  l  J 


Assume  that 


F,(-R.-Ri)  -  G,  +  G, 

7  7 


A,  B, 
C,  D, 


A  y  By, 

Cu  0 


where  G,  is  stable  and  Gv  is  completely  unstable. 


Si£&-£ 


Form  the  product 
R  -  M~lGv  M~l 


A  j  +  K  f  C  j 

l_jo_  1 

s,' 

A  2  +  B  2 K  c 

B2Rd~  w 

mm  | 

.  Rd^C, 

\t7*\ 

* 

c. 

0 

* 

L  K' 

Rd-1* 

Step  6  (Spectral  decomposition) 


R  *  Rt  +  Iiu 

with 


Ax  +  K,  <7, 

0 

-ZxByKe 

A  2  +  5  j/Cc 

-Z.ByRo'* 

B2Rd-1'1 

Ro^C, 

RD-v\clzl+Cy)Z2 

0 

|(BU  -Z2B2)/?d-» 

+  Cy  )!  0 

where 

(i)  Zi  is  the  unique  solution  of  the  following  Sylvester  equation: 

(A,  +K,  CJZt  -  Z,A,  +  K,  C„  =  0  . 

(ii)  Z2  is  the  unique  solution  of  the  following  Sylvester  equation: 

Av  Z  2  ~  Z  J(A  2  +  B  j/ff )  +  Bu  Kc  *  0  , 


Stop  7  (  Best  approximation  ) 

Find  the  best  approximation  of  Ry ,  i.e., 

f  (7)  "  q  11  ~  Q  llo°  '  (S'26) 

Using  Glover’s  algorithm,  /  (7)  can  be  computed  before  the  best  approximation  of  <7„  is  found.  If 
/  (7)  >  1,  it  means  that  the  value  of  7  is  too  small.  A  larger  7  should  be  chosen,  and  the  process 
will  restart  from  Step  1  . 

If  /  (7)  is  much  smaller  than  1,  in  general,  this  implies  that  the  value  of  7  is  over-estimated  and  a 
smaller  value  for  7  should  be  used.  It  is  desired  that  7,  is  equal  or  as  close  as  possible  to  1.  How¬ 
ever,  this  is  not  true  for  every  case. 


If  /  (7)  >•  1,  then  the  optimal  solution  to  Eq.  5-23  is  Q  ,  with  the  following  realization. 


■*Q 

M 

[c<3 

Do 

Therefore,  Q,  *»  Q  ,  +  R,  is  the  best  approximation  of  R  .  Correspondingly, 

Q,  -  AT1)^  -  MQ.M  +  G,  . 

Step  8 

(i)  Form 

Q.  -  Q,  +  R, 


-40 

Bq' 

A  1  +  K  j  C 1 

-  Z  xBt  Kc 

-Z^Ro'^ 

=- 

C0 

Do 

0 

A  j  + 

B2RD-ln 

0 

nD-l*c  x 

R  oVt{ClZl^C<.)Z2 

0 

AX+K,CX  0  -ZXBUKC 

- ZxBuRo *,/4 

0  Aq  0 

Bo 

0  0  A  2  +  B  2^ 

B2RD-ln 

Rd*Cx  Cq  Rp-^CxZx+CJZ2 

O 

(ii)  Form 
MQ.M 


\AX  +  K,CX  0  -ZXB,K. 

-  Z  XB,  R0~ 1/3 

K,  R  on 

O 

O 

X 

O 

Bq 

>1, 

‘ 

R}n 

* 

0  0  A ,  *  0,/C 

BZR0~ w 

* 

.  -  /?ow/C 

0^ 

RfV2Cl  CQ  R  o^CJt+C^Zt 

■4 1 

-K,  C, 

/C,  R  'onCq 

-K,  (CxZ^C,)Zt 

/C/  R  p*Dq  Ro^K, 

-  K,  Rbv,Dq  R0Vl 

0 

AX+K,CX 

0 

-Z,B,K. 

Z  tB,  K, 

-  Z  iB, 

0 

0 

Mg 

0 

-Bq  RtpK, 

Bq  Ro  n 

0 

0 

0 

^2  +  5  2^( 

-b2k, 

0, 

0 

0 

0 

0 

A  , 

b2 

C, 

C, 

R  PCq 

(C XZ i+C,  )J, 

R  l*Dq  Rq^K, 

R  o~Dq  Rd y* 

r. 


V  \  ^ ^  j* '*  -"v  A  ^  »'* 

^ -•  •-<  o.  ».-•  «-vn.w  «- .  -w-iw  v._-w  ■w-  jw-.  ^  w 
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6.1  Summary  of  H2  and  Hx  Optimal  Control 

This  section  is  a  recapitulation  of  the  results  developed  in  the  previous  chapters  to  give  a  complete 
algorithm  for  solving  H2  and  Hx  optimal  control  problems.  The  key  steps  are  summarized  as  follows: 

Step  li  Pamaietrisatlon 

Find  J  so  that  the  substitution  K  =F j  (J  ,Q  )  yields 

F,(P,K)  -  F,(P  ,F,(J  ,Q  )) 

-  F,(T.Q) 

=■  +  T12QT21 

with  the  additional  requirement  that  T  6  PH  x  and 

F(  (P  ,K  )  internally  stable 
if  and  only  if  Q  6  PHX. 

This  parametrizes  all  stabilizing  it  s  in  terms  of  a  stable  Q  6  if®  in  addition  to  providing  an  affine 
parametrization  of  all  (internally)  stable  F ,{P  ,K).  This  "Youla  parametrization*  |D5,Y2]  is 
developed  in  Section  2.2.  In  particular,  explicit  formula  for  J  and  7*,; 's  are  derived  using  the 
observer-based  stabilization  method  (Eqs.  2-33  and  2-34). 

Step  l*i  Coprime  Factorisation  with  Inner  Numerator 

A  further  requirement  is  that  and  T21  are  inner  and  co-inner  with  the  nonsingular  constant 
*  matrix  multiple  respectively.  Methods  for  obtaining  the  particular  parametrizations  which  achieve 
this  are  developed  in  Section  2.3.  By  appropriate  scaling,  the  new  affine  parametrization  of  the 
closed-loop  transfer  matrix  becomes 

^11  ~  ^  1  zQ  ^21 

where  Nl2  is  inner  and  i®  co-inner  with  realizations  shown  in  Eqs.  2-46  through  2-49. 


Step  2i  Unitary  Invariance 


Find  the  CIFs  N ^  and  N  ^  so  that  |N12 

(Eqs.  2-46  through  2-49)).  Then  pre-  and 
2-52) 


>1 


S ^  |  and 


N 


21 


lNl 


are  iquare  and  inner  (also  Section  2.3 


post-multiply  by  £.Yl2  jV^j*  and 


1  • 

N2i 

and 

* 1 . 

to  yield  (Eq. 


R  n~  Q  R 12 

u  ~  N l2Q  iV2j 

a 

R  21  R  22 

(a  =  2,  oc 

a 

where 


Rn 

R  12 

>i \ 

r  -  i 

R  - 

R  21 

R  22 

=■ 

.Ni. 

T ,» 

The  solutions  to  the  //2  and  H „  optimization  differ  completely  in  the  following  step. 


Step  3;  Projectlon/Dllatlon 

(A)  H2  optimization: 

For  a=~2,  the  unique  optimal  solution  for  <?  is  immediate  from  Eq.  2-54: 

Q  «»<  =*  P H,(^  ll) 


(B)  H jo  optimization: 

For  a»oo,  the  general  distance  problem  is  solved  using  q-iteration  (Chapters  3,  4  and  5).  The 
iterative  procedure  is  the  following: 

(i)  compute  the  lower  and  upper  bounds  (Section  4.1). 

(ii)  reduce  the  GDP  to  the  best  approximation  (Sections  4,1  and  5.3). 


(iii)  Sod  the  Hankel  norm  of  the  corresponding  approximation  problem  (Section  5.4). 

Iterate  on  (ii)  an  (iii)  using  the  properties  of  ^-iteration  (continuity,  monotonicity  and  convexity) 
until  the  optimal  7  (or  arbitrarily  closed  to)  is  found  (Section  4.2). 

(iv)  derive  the  state-space  formula  for  Q  ,p,  (Sections  5.4  and  5.5). 

The  complete  state-space  procedure  for  both  2-  and  4-block  GDPs  are  shown  in  Section  5.5 

Step  4i  Recovery  of  the  optimal  controller 

(i)  First,  recover  Q,pt  from  Q  ,pl  through  E<j.  2-51: 

Q.p,  -  -Ro'^Q  .ptRo^ 

(ii)  The  optimal  controller  Kcpt  can  be  recovered  easily  from  the  LFT 

K.P,  -  F  ,(J,Qtpl) 

where  J  is  shown  as  in  Eq.  2-33. 


The  above  steps  are  also  summarized  in  Figure  6-1. 


min  \  ||  Y{PJC)  ||a  |  internal  stability 

L 


Parametrizarion 
K  -  F/J.Q) 


min  II  7*n  +  TmQTu  ll« 

Qm  Hm 


♦ 

min  ||  r„  -  Nu£>Nn  ||a 

Q  m  Hm 


where 


"12X2  -/ 
^21^21*  *  ^ 


min 


*lH2  *12 
,  *21  *22. 


a-2 

Projection 


/ 


\ 


V 


Qapt 


/ 


a  *  •• 

GDP 


Kap,  =  F^/.Cop.) 


Figure  M. 


■yr^v 
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0.3  Numerical  Consideration* 

The  implementation  of  the  procedure  described  in  Section  6.1  is  quite  straightforward  since  at  each 
step,  the  state-space  formula  are  also  derived  in  parallel  to  the  theory.  Therefore,  only  standard  matrix 
operations  (addition,  subtraction,  multiplication,  inversion)  and  the  familiar  linear  algebra  techniques 
(singular  value  decomposition,  eigenvalue/eigenvector  decomposition,  Schur  decomposition,  etc.)  are 
required  essentially.  They  constitute  the  nucleus  of  the  entire  approach  and  there  are  very  reliable 
software  packages  available  [D6,Glj  for  this  purpose. 

Most  of  algorithms  developed  require  solving  some  particular  algebraic  equation  at  an  intermediate 
stage.  They  are  ARE,  Lyapunov,  and  Sylvester  equations.  Therefore,  robust  algorithms  for  solving  these 
equations  are  extremely  critical  to  a  successful  implementation.  Many  reliable  algorithms  can  be  found  in 
the  literature,  (for  example,  (B2,G3,Hl,Ll,Vlj). 

Note  that  the  minimality  (or  stabilizabiiity /detectability)  condition  is  an  important  assumption  in 
most  of  algorithms  (coprime  factorization,  spectral  factorization,  etc.).  To  guarantee  this  condition,  a 
simple  procedure  is  recommended  here  which  is  closely  related  to  the  notion  of  ’’balanced"  realization 
(Section  5.4). 


Recall  that,  for  a  given  stable  transfer  matrix  G 


A  B 
C  D 


if  the  realization  is  balanced  (Section 


5.4),  it  must  be  minimal.  Therefore,  minimality  can  be  achieved  by  "balancing".  Although  all  the 
balancing  algorithms  developed  so  far  require  the  given  realization  (before  balancing)  to  be  minimal 
[L2,M4j,  they  can  be  modiSed  to  handle  the  non-minimal  case  by  removing  the  singularity  of  controllabil¬ 
ity  and/or  observability  gramians.  The  (balanced)  minimal  realization  can  be  obtained  similarly  for  the 
completely  unstable  system  (see  Section  5.4).  If  the  given  realization  of  G  has  both  stable  and  unstable 
eigenvalues  (but  not  on  the  ;  u-axis),  then  G  can  be  expressed  as 


G  »  G,  +  (7U 


where  G ,  is  stable  and  Gu  is  completely  unstable. 
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This  can  be  accomplished  using  Schur  decomposition  with  eigenvalue  ordering  [Ll.Vlj  and  solving  a  set  of 
linear  equations  (see  Step  4  in  Section  5.5(A)).  Then,  combining  the  balanced  realizations  of  G,  and  Gu 
leads  to  a  minimal  realization  of  G .  Since  the  state  matrix  of  either  G,  or  Gu  is  already  in  the  upper 
real  Schur  form,  the  Lyapunov  equations  associated  with  gramians  can  be  solved  almost  immediately 
[B2,G3|. 

This  technique  can  be  implemented  very  efficiently  and  reliably.  It  does  not  increase  the  complexity 
of  the  software  since  the  balanced  realization  is  essential  to  solve  the  best  approximation  problem  (Section 
5.4).  The  testing  results  are  very  satisfactory. 
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6.3  Model  Reduction  In  H,^  Synthetic 

The  importance  of  model  reduction  in  control  system  design  has  long  been  recognized.  For  practical 
implementation,  it  is  desired  that  the  order  of  the  controller  can  be  reduced  in  a  way  such  that  the  con¬ 
trolled  system  still  satisfies  the  performance  requirements.  Typically,  there  are  two  ways  to  obtaiD  a 
lower  order  controller:  reducing  the  complexity  of  the  plant  model  and  using  model  reduction  in  the 
design  process  [E1,G2|.  This  section  considers  the  latter  issue. 

High-order  optimal  controllers  are  usually  derived  when  using  H <*,  optimization.  This  can  be  seen 
clearly  from  the  state-space  formula  shown  in  Section  5.5.  Therefore,  model  reduction  is  inevitable  from  a 
practical  point  of  view.  Recall  that  the  order  of  the  controller  K  =»  Ft(K,  Q  )  is  generically  equal  to  the 
order  of  K, ,  which  is  the  same  as  the  original  interconnection  structure  P ,  plus  the  order  of  Q  .  Thus  a 
natural  first  step  in  obtaining  reduced-order  controllers  is  to  consider  techniques  that  result  in  lower-order 
Q  s.  Another  candidate  for  model  reduction  is  the  R  in  the  GDP,  since  it  is  the  complexity  of  R  that 
affects  not  only  the  order  of  Q  but  also  the  computational  burden  involved  in  computing  Q  .  The  follow¬ 
ing  analysis  shows  how  the  model  reduction  can  be  performed  in  the  GDP  with  simple  i^-norm  bounds 
on  the  resulting  loss  of  performance. 

Assume  that  Q  lpt  is  the  optimal  solution  of  the  GDP: 

Q  ol  II 

hen  for  model  reduction,  one  has  the  following  two  results. 


7. 


min 
<?  e 


(i)  Model  reduction  on  R  : 


Suppose  R  is  a  reduced-order  model  of  R  ,  and  Q  tr,  is  the  optimal  solution  of 


.  min 

Q  €  f>Hx 


QO 


Define 


yr.- 


The  question  is  how  much  error  (i,  ■■  7, )  is  incurred  if  the  reduced  order  model  R  is  used  in  the 


GDP.  This  is  found  as  follows: 


7.  <  II*  - 


Q.„  0 

0  0 


<  *  - 


<  * 


<  *  - 


Q  ,Pt  0 

0  oj  *  . 

Q  0] 

0  0  +  II*  -  *  I  loo 

J  00 

Q  tpi  0 

0  0  I  I*  -  *  I  loo 

u  u  J  00 

Q.f,  ol  I 

0  oj +  L + 11*-* 

Q  spt  0 

0  0  +  2II*  -  *  I  loo 


-  7.  +  2||fl  -tf|L 
•••  if#  -7,  <  2||7?  -  /?  IU 
This  inequality  shows  that  the  error  is  no  more  than  2|| R  -  R  (la,. 


(ii)  Model  reduction  on  Q 


Suppose  that  Q  „„  is  a  reduced  order  model  of  the  optimal  solution  Q  ,pt  ■  Then 


Q'pp  0 

7.  <  *  -  00 


1  >pt  0  Q  apt 

0  0  * 


0  0 


<  *  - 


Q,p<  0 

0  0 


+  \\Q„t  -Q 


^  7a  ^  |  1 Q  opt  —  Q  tpp 


(6-2) 


Therefore,  model  reduction  on  Q  ,ft  will  introduce  an  error  of  no  more  than  |  |Q  ,fl  -  Q  w  1 

Suppose  that  model  reduction  in  the  synthesis  is  done  by  the  two  steps: 

(a)  find  the  reduced-order  model  R  and  the  solution,  Q  tf{  of  the  corresponding  GDP, 

(b)  find  the  reduced-order  model,  Q  ,  of  Q  ,ft . 

Then, 

Q  0 

*-  0  0  <  2||/f -AIL  ♦  \\Q',t -gwiu  (^3) 

J  00 

This  error  bound  can  be  derived  easily  by  combining  the  results  of  (i)  and  (ii)  above. 

The  above  result  is  very  encouraging  since  if  the  error  bound  in  Eq.  6-3  is  guaranteed  to  be  small  in 
model  reduction,  it  will  not  affect  the  performance  too  much.  Using  either  the  method  of  truncation  of 
the  balanced  realization  [E1,G2|  or  the  method  of  Hankel-norm  approximation  [G2],  the  reduced-order 
model  can  be  found  using  reliable  algorithms.  Furthermore,  both  methods  give  the  error  bounds  in  terms 
of  the  Loo-norm  which  are  computable  from  the  the  second  order  modes  of  the  given  system  (Section  5.4). 
A  more  detailed  treatment  on  this  subject  can  be  found  elsewhere  [E1,G2|.  Experience  to  date  has  shown 
that  in  many  practical  problems,  both  the  order  of  R  and  Q  ,,t  can  be  reduced  significantly  without 
incurring  too  much  error. 


APPENDIX  A 
LEMMAS 


This  appendix  includes  lemmas  which  were  used  in  proving  some  of  theorems. 


Lemma  A.l 


Assume  A  ,  B  €  L®,  then 


Proof 


(i)  Left  inequality: 


By  definition, 


IM  II. 

II  B  |L 


<  Si 


L  B, 

=»  SUp  ^^(AM 

<  sup  )  +  9*( B  )  j 

<  [iM  \\l  *  IM  Wl]* 


IM  ||.l 
II  B  |U  , 


(:i)  Right  inequality: 


A  IL  < 


and  1 1  B 


^VoVVoV^o\--,-VV,--.V.A-\A 
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IM  Hoc  II  b  |U 
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QED 


Ltrnmi  A.3 

Assume  A  =■>  |  -X-  T  j  €  £  *•  Then, 

T*T  -  7s/  -  A* A 

where 

U(I  -L*L)Vi  o' 

T  “  [-(S-l)*r*A’  5 

5  -  (r/  -  r«r)w 

and 

l  -  -  rr*)-^ 

Proof 

Instead  of  verifying  the  equality  directly  by  forming  the  product  T*T  which  is  just  the  routine 
algebra,  a  different  but  more  informative  approach  will  be  used  here. 

Recall  that  in  the  constant  positive  semi-definite  matrix  case,  a  Cholesky  factorization  can  be  found 


with  either  lower  or  upper  triangular  factors.  This  fact  can  be  extended  to  the  "block"  triangular  form. 
Therefore,  the  matrix  T  can  be  assumed  to  have  the  following  block  lower  triangular  form: 


WW  +  VV 

vs 

7 2I  -X'X 

-X*Y 

> 

s*v 

5*5 

“  -  Y*X 

7 2l  -  Y*  Y 

Therefore, 


WW  +  VV 
5*  V 
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7 21  -  A'*|/  +  (t2/  -  YY*)-'YY* \X 

7 2 1  -  72A'*(72/  -  YV)~lX 

7  V  -L-L) 


W  -  7 [1  -L  *L  )l 


The  following  two  lemmas  are  well-known. 


Lemma  A. 4  (Solution  of  Sylvester  Equations) 


Consider  the  Sylvester  equation 


AX  +  XB 


C 


(A- 1 ) 


where  A  6R'X",B  6  R"  ** ,  and  C  6  R"x"  are  given  matrices.  Then,  there  exists  a  unique  solu¬ 
tion  A  6  R'**  if  and  only  if 

Re(X,  (A  )  +  X;(B)j  ^  0,  V  i  =■  1, . n  and  j  =  1 . m  . 


a 


Remark 

In  particular,  if  B  =  A  r  ,  Eq.  A-l  is  called  the  "Lyapunov  Equation"  and  the  necessary  and 
sufficient  condition  lor  the  existence  of  a  unique  solution  will  be  that 

Re|X,  {A  )  +  \j  (A  )]  ^  0,  V  »',  j  =  1, . ,n  . 


Lemma  A.S  (Solution  of  Linear  Equations) 

Consider  the  linear  equation 

A  X  =  B 

where  A  €R‘X"  and  B  6R,X’  are  given  matrices. 

The  following  statements  are  equivalent: 

(i)  there  exists  a  solution  X  €  F"  *"  • 

(ii)  the  columns  of  B  €  Range  (A  )  . 

(iii)  rank  ^  A  B  j  =*  rant  ^  A  j  . 

(iv)  Ker  (A  r)  C  Ker  (flr)  . 


m 


APPENDIX  B 

ISOMORPHISM  BETWEEN 
THE  HALF-PLANE  AND  THE  UNIT  DISC 


Define  the  transformation 


«-l 

t  +1 


,  1+r . 

*  —  — 

l-Z 


(B-l) 


which  maps  the  right  half  plane  (Re(»)  >  0)  onto  the  unit  disc  ( |  z  |  <  1).  Therefore,  the  relation 
between  a  point  ju  on  the  imaginary  axis  and  the  corresponding  point  eji  on  the  unit  circle  is,  from  Eq. 
B-l, 


,  =  ju-l 
;'u+l 


Also  define  the  function 


V2 


(»+D 


This  yields  d  8 


w2+ 1 


du  =  -  |  i>(j  w)  |  2  du  . 


Let  T  be  the  unit  circle  in  z  -domain  and  /R  be  the  imaginary  axis  in  » -plane.  This  implies  that 
the  mapping 


/  -V-/  :  Wl  —  ffjO'R). 


where  /  (z ) 


/(*) 


_  l+£ 

1-J 


is  an  isomorphism.  Similarly, 


/  —  tf  :  H^T) i  -  ffj(,IRH 

is  an  isomorphism;  note  that  if  /  6  H2(T)L  ,  then  /  =  0  at  z  —  oo,  so  that  /  *  0  at  »  *  -1,  and 
hence  i>}  is  analytic  in  Re  *  <  0. 


APPENDIX  C 

OPERATIONS  ON  LINEAR  SYSTEMS 


This  appendix  describes  some  of  standard  operations  on  linear  systems  in  terms  of  transfer  functions 
and  their  realizations  which  were  used  in  previous  chapters. 


1.  Cascade 


Bx 

B, 

Ct 

Dx_ 

.  G2  “ 
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B  x 
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Ax  B xC 2 
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b2 

- 

B \C 2  A  i 

B\D2 

C  i  D  j  C  2 

DxD2 

0 1 C  2  C  \ 

D\D2 

Note:  This  realization  may  not  be  minimal. 


2.  Change  of  Variables 

x  —  z  -  Tx 

y  —  y  9i  Ry 

u  — *  u  ™  Pu 


A 

_B 

A_ 

1  B 

[r  ol 

'dJ 

5 
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|7T 

* 

o 

_ 1 

<f| 

0 
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Tj 

i 

TAT'1 

{ rap 

-l 

RCT 

|/?Z>P 

T 
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3.  State  Feedback 


4.  Output  Injection 


5.  Transpoee  (Dual) 


0.  Conjugate 
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